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On Rearrangeable Three-Stage 


Connecting Networks 


By V. E. BENES 
(Manuscript received December 26, 1961) 


A class of three-stage connecting networks proven rearrangeable by D. 
Slepian is considered. Bounds on the number of calls that must be moved are 
obtained by some simple new methods. 


I. INTRODUCTION 


Most communications systems contain a connecting network as a basic 
functional unit. A connecting network is an arrangement of switches and 
transmission links through which certain terminals can be connected 
together in many combinations. 

The calls in progress in a connecting network do not usually arise in a 
predetermined time sequence. Requests for connection (new calls) and 
terminations of connection (hangups) occur more or less at random. For 
this reason the performance of a connecting network when subjected to 
random traffic is used as a figure of merit. This performance is measured, 
for example, by the fraction of requested connections that cannot be 
completed, or the probability of blocking. 

The performance of a connecting network for a given level of offered 
traffic is determined largely by its configuration or structure. This structure 
may be described by stating what terminals have a switch placed be- 
tween them, and can be connected together by closing the switch. The 
structure of a connecting network determines what combinations of 
terminals can be connected together simultaneously. If this structure is 
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too simple, only a few calls can be in progress at the same time; if the 
structure is extensive and complex, it may indeed provide for many large 
groups of simultaneous calls in progress, but the network itself may be 
expensive to build and difficult to control. 

The structure of a connecting network also gives rise to various purely 
combinatory properties that are useful in assessing performance. For 
example, C. Clos! has exhibited a whole class of connecting networks that 
are nonblocking: no matter in what state the network may be, it is always 
possible to connect together an idle pair of terminals without disturbing 
calls already in progress. We call such a network nonblocking in the strict 
sense, because it has no blocking states whatever. 

If a connecting network does have blocking states, it is nevertheless 
possible that by suitably choosing routes for new calls one can confine 
the trajectory of the operating system to nonblocking states. That is, 
there may exist a rule whose use in putting up new calls results in avoid- 
ing all the blocking states, so that the system is effectively nonblocking. 
The rule only affects new calls that could be put into the network in more 
than one way; no call already in progress is to be disturbed. Connecting 
networks for which such a rule exists we call nonblocking in the wide sense. 

In practice, the procedure of routing the calls through the network is 
called “packing” (the calls), and the method used to choose the routes is 
called a ‘packing rule.’”’ The use of the word ‘‘packing”’ in this context 
was undoubtedly suggested by a natural analogy with packing objects 
in a container. Virtually nothing rigorous is known about the effect of 
packing rules on network performance. 

Finally, a connecting network may or may not have the property of 
being rearrangeable: given any set of calls in progress and any pair of idle 
terminals, the existing calls can be reassigned new routes (if necessary) 
so as to make it possible to connect the idle pair. 

These three combinatory properties of connecting networks have been 
given general topological characterizations in a previous paper.? In this 
paper we consider the last property described, that of rearrangeability, 
and we study the extent to which it applies to a specific class of connect- 
ing networks. 

Fig. 1 shows a typical member of an interesting and useful class of 
connecting networks that has been suggested and studied by C. Clos.! 
We refer to this class as that of three-stage Clos networks. Such a network 
consists of two symmetrical outside stages of rectangular switches, with 
an inner stage of square switches. It is completely determined by the 
integer parameters m, n, r that give the switch dimensions. In one of the 
few outstanding contributions to the theory of connecting networks, 
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Fig. 1 — Three-stage Clos network N(m,n,r). 


Clos! showed that for m 2 2n — 1 the network is nonblocking in the 


strict sense. The network defined by the parameters m, n, r will be de- 
noted by N(m, n, r). 


II. SUMMARY 


The following two known results about rearranging three-stage Clos 
connecting networks are discussed: 

t. The Slepian-Duguid theorem, which states that the network 
N(m, n, r) is rearrangeable if and only if m 2 n. 

ai. The theorem of M. C. Paull, which states that if m = n = r, then 
at most n — 1 existing calls need be moved in N(n, n, n) in order to con- 
nect an idle terminal pair. 

The principal new result proven is a generalization (and possible im- 
provement) of Paull’s bound in (zz) for any m,n, r with m 2 n tor — 1. 

The Slepian-Duguid theorem is proved in Section III by an inductive 
method due to Duguid* depending on the combinatory theorem of P. 
Hall on distinct representatives of subsets. We discuss Paull’s theorem 
in Section IV, but defer our simple proof of it to Section VI, which 
presents simple inductive proofs of various bounds on the number of calls 
that must be moved. All the proofs to be given depend on a “canonical 
reduction” procedure that consists in removing a middle switch from the 
network and reducing the parameters m and n by unity. 


Il. THE SLEPIAN-DUGUID THEOREM 


The present paper is devoted to studying the property of rearrangea- 
bility for three-stage Clos networks. We shall particularly be concerned 
first with the possibility of rearranging calls, and later with the number 
of calls that must be moved. Strictly nonblocking Clos networks will not 
be considered except incidentally, in view of Clos’s own definitive study 
of them.! 
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Our first result is due essentially to D. Slepian,! and is 

Theorem 1 (Slepian-Duguid): Every three-stage Clos network withm = n 
ts rearrangeable. 

Proof: The proof to be given is due to A. M. Duguid.’* Slepian’s proof 
was stated for the case m = n = r, but actually gave an explicit procedure 
for rearranging the existing calls so that the additional desired call could 
be put up. He showed for this case that at most 2n — 2 calls must be dis- 
turbed. This bound was subsequently improved to n — 1 by M. C. 
Paull.’ (See Section IV.) 

Duguid’s proof depends on a combinatorial theorem of P. Hall, which 
has recently come into prominence in studies of maximal flows in net- 
works. (See D. Gale.§) 

Hall’s Theorem: Let A be any set, and let A, , A2,--- , A, be any r sub- 
sets of A. A necessary and sufficient condition that there exist a set of distinct 
representatives a1, °°: ,@, 0f Ai1,°::, A,r, t.€., elements a,,--: ,a,of A 
such that 

a,eA; t=1,---,r 
a; a; for j ¥7%, 
is that for each k in the range 1 S k S 1, the union of any k of the sets 
A,,°°:, Ar have at least k elements. 

The condition given is obviously necessary. The interest of the 
theorem, and our application of it, concern the sufficiency. 

We proceed now to the proof of Theorem 1. It is obviously sufficient 
to consider only the case m = n. Let the inlets to the network be denoted 
by wu1,°-:, uw, where N = nr, and let the outlets be denoted by 
v1, °°* , Uy. It is sufficient to prove that every maximal assignment of 
inlets to outlets can be realized by a state of the network. Here ‘‘maxi- 
mal’’ means that each inlet is to be connected to exactly one outlet, and 
vice versa. Such a maximal assignment is obviously equivalent to a per- 
mutation on N objects. We let {7 — (2), 7 = 1,---, N} be such a 
permutation; also we denote the jth inlet switch by J; and the jth outlet 
switch by O;. It is convenient to think of J; as the set of ¢ for which u; 
is on the jth inlet switch, and of O; as the set of 7 for which v; is on the 
ath outlet switch. 

Let K be the set of integers {1, 2,--- , n}. We define the subsets 
{K;,t = 1, --- , n} of K by the condition 

K; = {7 |\ae(m) «0; forsome meI;}. 
*In a private communication from J. H. Déjean, the author has learned that 


Theorem 1 was also proved by J. LeCorre in an unpublished memorandum dated 
1959. 
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Now let Jia), --- , Zig) be any k of the inlet switches, and set 
K 
= U Kigy * 
j=l 


Suppose that there are ¢ distinct elements in 7’. Then all the kr inlets in 
the set 


k 
U ici) 
jal 


are assigned by z(-) to outlets from ¢ of the outlet switches, that is, to 
outlets from a set of tr outlets. But two distinct inlets are not assigned 
to one outlet, sot 2 k. Thus any union of k sets among the K; contains 
at least k elements. 

Hence by Hall’s Theorem there is a set of distinct representatives 
{k(),7 = 1,---,n} with 


k(t) ¢ K; t= 1e-yn 
k(i) 4 k(j) for ij. 


Since K contains n elements, it follows that {7 — k(7),7 = 1, --- , n} is 
a permutation. However, the interpretation of the fact that k(z) « K; 1s 
that 


a(m) € Ox) forsome meTl;. 


In other words, to every inlet switch J; there corresponds a unique outlet 
switch O; ;) such that +(-) maps some inlet on J; into some outlet on 
Oxi) . That is, there is a subassignment of w(-) that involves exactly one 
terminal on every inlet and outlet switch. 

It is evident that such a subassignment can always be satisfied on a 
single middle switch (Fig. 1), say that numbered 1. If this subassignment 
is completed, that one switch is filled to capacity, and the rest of the 
network is essentially N(m — 1,n — 1,71), ie., that of Fig. 1 with the 
parameters m, n reduced by unity. 

The theorem is clearly true form = n = 1. As an hypothesis of induc- 
tion assume that it is true for a given value of m — 1 (= n — 1). The 
argument given above proves that it is then also true for m (= n), for 
the induction hypothesis implies that the remainder of the assignment 
a(-) that was not put up on the first switch is satisfiable in the subnet- 
work, i.e., essentially in N@m — 1, n — 1, r). Hence x(-) is realizable, 
and the theorem follows by induction on n. 
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IV. THE NUMBER OF CALLS THAT MUST BE MOVED: PAULL’S THEOREM 


In view of the result of Slepian and Duguid that every three-stage 
Clos network with m = n is rearrangeable, it is natural to ask, for a given 
state x of such a network, how many calls of x need actually be changed 
to new routes in order to put in a given call between idle terminals. 
Slepian’s original procedure was for the case m = n = r, and gave the 
upper bound 2n — 2 (uniformly for all states) to the number of calls 
that must be disturbed. That is, he showed that if m = n = 1, then at 
most 2n — 2 calls need be rearranged. By a similar but more complicated 
method, M. C. Paull halved this bound, proving 

Theorem 2: Let N(n, n,n) be a three-stage Clos network with m = n = r. 
Let x be an arbitrary state of this network. The largest number of calls in 
progress in x that must be rerouted in order to connect an idle patr of terminals 
ts n — 1; there exist states which achieve this bound. 

Since Paull’s proof was involved, we have looked for and found simpler 
ways of proving and extending his result. In Section VI we give a simple 
inductive proof; the argument to be given, of course, also provides a 
proof of the Slepian-Duguid theorem not depending on the Hall com- 
binatorial result used in Section III. 


V. SOME FORMAL PRELIMINARIES 


In order to state and prove the rest of our results, it is useful, and in- 
deed necessary, to introduce a systematic notation. Such a notation has 
been described and used in a previous paper? by the author; the notation 
to be used is a consistent extension of this. 

The set of inlets of a network is denoted by J, and that of outlets by 
Q. The set of possible states of a connecting network is denoted by S. 
For a three-stage Clos network, S consists of all the ways of connecting 
a set of inlets to as many outlets by disjoint chains (paths) through an 
inlet switch, a middle switch, and an outlet switch. (See Fig. 1.) States 
of the network may then be thought of as sets of such chains. Variables 
X,Y, 2, °°: , at the end of the alphabet, range over states from S. 

A terminal pair (u, v) e J X Q (with u an inlet and v an outlet) is called 
idle in state x if neither u nor v is an endpoint of a chain belonging to x. 
A call c is a unit subset c = {(u, v)} C I XQ; ¢ isnewin a state x if 
(u, v) is idle in x. The assignment y(x) realized by x is the union of all 
calls c = {(u, v)} such that x contains a chain from wu to v. If a is an 
assignment, y~!(a) is the set of all states realizing a. The cardinality of 
a set X is denoted by | X |. The states x ¢ S are partially ordered by 
inclusion S in a natural way. 
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A distance between states can be defined as 
(x, y) 3 | cAy | ) 


= the number of calls that would have to be added, removed, 
or rerouted to change x into y, 


where A is symmetric difference. The distance of a state x from a set X 
of states is defined in the usual way as 


d(x, X) = min 6(a, y). 
yeX 


A call ¢ new in a state x is blocked in z if there is no state y > x such 
that y(y) = y(x) Uc. Astate x is nonblocking if no call new in z is blocked 
in «. The set of nonblocking states is denoted by B’. For any call c, the 
set of states x in which c is both new and not blocked is designated B,’. 

For a three-stage Clos network N(m, n, r) with m = n we define 

gx(m, N, 7) = max  6(x, y"M(y(x) Ue)) — 1 


e new inz 


= max  6(x, y"(y(x)) N B.’) 


enew inz 


me max min d(x, y) 
cnew ing yey1(y (z)) NB,’ 


the maximum number of calls that must be re- 
routed in order to put up a call c new in x. 
We also set 


o(m, n,; r) = max g2(m, n, it). 
zeS 


In this last definition, it is assumed that S is the set of states determined 
by the parameters m, n, rin Fig. 1. 

In the notation introduced above, the Slepian-Duguid Theorem guar- 
antees that for m 2 n and ¢ new in x 


y (r(x) Uc) # 0, 
y(y(z)) NB.’ 0, 
and Paull’s Theorem may be cast as stating that 


g(n,n,n) =n — 1. 


VI. THE NUMBER OF CALLS THAT MUST BE MOVED: NEW RESULTS 


We now present some new methods for studying the number of calls 
that must be moved; these yield extensions of results of D. Slepian* and 
M. C. Paull.é 
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Theorem 8: o(2, 2, r) S 2r — 2. 

Proof: Suppose that a blocked new call between input switch J;, and 
output switch O,; is to be put in when the network is in a state x. Con- 
sider any sequence ¢,, --- , c; of existing calls of « with the properties 

a. Hither c, is on J; , c, and cz are the same outlet switch, --- , 


c; and ¢;4; are on the same outlet switch, 7odd, 7<k 
c; and ¢;,; are on the same inlet switch, aeven, 2 <k, 


or ¢; is on O; , c, and c are on the same inlet switch, --- , 


c; and ¢;4; are on the same inlet switch, zodd, it<k 
c; and C;41 are on the same outlet switch, zdeven, 7 <k. 


1. Ce is the only call on some outer switch. Since neither J; nor O, is 
full, the largest k for which such a sequence exists is 2r — 2. The reader 
can verify that a possible strategy for rearranging existing calls of x so 
as to put in an J,-O, call is to take each call of the sequence c,, --- , cz 
and reverse its route, i.e., make it go through the other middle switch 
than the one it presently uses. Thus for all 


o(2, 2,7) 2-2. 


Let x be a state of N(m, n, r), and let M be a particular middle switch. 

A canonical reduction of x with respect to M will consist of 
z. removing M, 

a. on each outer switch that has a call routed via M, removing the 
link, crosspoints, and terminals associated with that call, 

wt. on: cach outer switch that has an idle link to M, removing the 
link, the crosspoints associated therewith, and one arbitrarily chosen idle 
terminal. ; 

It is easily seen that a canonical reduction of a state x of N(m, n, 1) 
leads to a state of N(m — 1, — 1,71). 

Theorem 4: g(n,n,7r) S 2r — 2. 

Proof: By Theorem 8, the result holds for n = 2, so assume it for a 
given value of n — 1 2 2, and try to rearrange a given state x of 
N(n, n, 7) so as to put in a new blocked call from J; to 0, . 

Case 1: There is a middle switch M/ with both an J; and an Q, call on it. 
Perform a canonical reduction of the state x with respect to M. This 
yields a state of N(n — 1, — 1,7), for which the result holds. 

Case 2: No middle switch has both an J, and an Q, call on it. Since the 
call to be put in is blocked, it must be true that 


# (idle links out of J1) + # (idle links out of O1) = n 
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and hence 


max{# (idle out of 7,), # (idle out of O,)} > 1. 


Suppose that # (idle out of /;) > 1. There is a middle switch M with an 
idle link to J; , and a busy link to O, . Perform a canonical reduction of 
x with respect to M, yielding a state of N(n — 1, n — 1, r) in which 
each of J; , O, still has an idle terminal. 

A refinement of this method suggested by M. C. Paull will halve the 
last two bounds. We prove 

Theorem 5: o(2,2,r) Sr—1.(r 2 2) 

Proof: The result is true for r = 2, since in that case the network has 
only one blocking state (see Fig. 2), and both blocked calls can be un- 
blocked by changing the route of one (= r — 1) existing call. 

Let us assume as an hypothesis of induction that the theorem holds 
for some value of r — 1 2 2, and in N(2, 2, r) attempt to put up a 
blocked new call c between input switch J; and output switch O, . Since 
c is new and blocked, there must be an idle and a busy link on both of J, 
and O,, and each of the busy links must pass through a different middle 
switch. Let c; be the call on J; , and c2 be the call on O, . We may suppose 
without loss of generality that c, is a call from J; to Oz , while cz is a call 
from I, to O,. 

Case 1: I; has only one call on it, viz., c.. Move cz to the other middle 
switch (see Tig. 3). 

Case 2: I. has two calls on it. Remove both c; and cz, so that J; and 0, 
become empty. Consider now the state x of the subnetwork of parameter 
r — 1 obtained by removing J, and O, and reducing the dimension of the 
two square middle switches by unity to r — 1. Each of J, and O: has at 
least one idle terminal in x, since c,; and cz were removed. Hence by the 
hypothesis of induction the subnetwork can be rearranged so as to put in 
a call from [2 to O2 while disturbing at most r — 2 existing calls. If the 
I,-O2 path thus provided is via M, then c; and cz can be replaced as in 


“Wale ed 


ee fl eerees (| eee ee 


Fig. 2 — Network with only one blocking state (r = 2). 
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OUTPUT 
O 





Fig. 3 — I; with one call, ce. 


Fig. 4. This leaves a path for the new 1-0, call c via M2 , and shows that 
it was never necessary to move c;, and that hence at most r — 1 calls 
were disturbed. If the [2-O2 path provided by rearranging the subnetwork 
is via M, , then c; and cz can be replaced as in Fig. 5. This leaves a path 
for c via M, , and shows that c, did not really have to be moved, so that 
at most r — 1 calls were disturbed. 

Theorem 6: p(n, n,r) Sr — 1. 

Proof: The result is true for n =.2. Assume that the theorem is true 
for a given value of n — 1 = 2, and seek to rearrange a state x of 
N(n, n, r) so as to put in a new call blocked in x between J; and O, . The 
theorem follows by induction on » by distinguishing two cases as in 
Theorem 4, and using a canonical reduction of 2. 

Theorem 7: Form —1 =n, 


g(m, nN, r) = e(m = i; nN, r). 


Proof: This is almost obvious. Remove any middle switch J of 
N(m, n, r) and make all terminals on which there were calls routed via 
M idle. This gives a state of N(m — 1, n, 7); in this state the desired call 


OUTPUT 





Fig. 4 — Calls c: and cz over path via M, . 
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OUTPUT 
10) 





Ce Cy 


Fig. 5 — Calls c; and cz over path via M.. 


can be put in by rearranging at most ¢(m — 1, n, r) existing calls. Now 
replace M and the calls that were routed through it. 
M. C. Paull’ has conjectured that if r = n, then 


g(m,n,n) S 2n-—1— Mm. 


This bound agrees with Theorem 2 if m = n, and with Clos’ results on 
nonblocking networks if m = 2n — 1. Paull has proved the result for 
m = 2n — 2. However, no proof of the full conjecture has been found. 
It is tempting to try the stronger conjecture that 


g(m,n,r) S$ 2n-—-1—m 


for any m, n, and r. This can be disproved by the counterexample shown 
in Fig. 6. There is no way of connecting J; to O; without moving a call 
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Fig. 6 — Network showing that J; and O; cannot be connected without moving 
a call on one of 11, Os . 
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on one of J; , O; . However, all possible alternative routes for these calls 
are pre-empted, so at least two calls must be moved. . 
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A High-Precision Direct-Reading Loss 
and Phase Measuring Set for Carrier 
Frequencies 


By J. S. ELLIOTT 


(Manuscript received December 28, 1961) 


A set has been developed to measure the parameters of insertion loss and 
phase shift of communication systems components. The design effort has 
been directed toward achieving laboratory precision in measurement and at 
the same time maintaining the ease of use and speed necessary for a large 
volume of measurements. Accuracies of +0.002 db, +0.02 degree and +1 
cycle have been attained over the frequency range from 10 to 300 ke. The en- 
tire frequency range is covered without band switching by the use of a hetero- 
dyne signal oscillator which provides frequency accuracy by locking to a 
frequency standard. The principle of phase detection is based on measure- 
ment of the time interval corresponding to the displacement of sine wave zero 
crossings caused by the unknown. This method has the advantage of good 
accuracy inherent in the measurement of time by counting techniques and 
also the ease of automatic readout of phase shift by translation from time 
units. In measuring loss, use of a rapid sampling technique to compare the 
unknown with a standard eliminates errors caused by circutt drifts. 


The ever increasing complexity of communication systems and the de- 
mand for high-quality transmission have emphasized the need for close 
control of system components such as filters, equalizers, and repeaters. 
Precise instrumentation to measure the parameters of loss and phase 
shift must be provided for use in both the development and production 
areas of these components. 

In the case of broadband carrier systems this need was met by develop- 
ment of a 3.6 mc phase measuring set.! Later, the set was modified to 
provide increases in maximum frequency from 3.6 to 20 me, loss measure- 
ment accuracy from +0.05 to +0.02 db and phase measurement ac- 
curacy from +0.25 to +0.1 degree. 

The need for instrumentation to control components in the case of 
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high-speed data systems has been met by the recent development of a 
10 to 300 ke phase measuring set. This set will automatically measure 
and read out insertion phase shift to an accuracy of 0.02 degree and will 
measure insertion loss to an accuracy of 0.002 db. 

The performance specifications of the new set are more completely 
stated as follows: 

Frequency: 10 to 300 kilocycles; maximum accuracy +1 cycle 

Generator and network termination impedance: 75 ohms unbalanced 

Test signal level: +6 dbm 

Insertion loss range: 0 to 100 db, maximum accuracy +-0.002 db 

Insertion phase shift range: 0 to 360°, maximum accuracy -+-0.02° 

The quantities measured are defined in Fig. 1. Conforming to these 
definitions, the measuring system compares phase and amplitude of the 
outputs of two transmission channels energized from the measurement 
frequency source, one of which serves as a standard channel while the 
other contains the apparatus under test. This is illustrated in the block 
diagram of I'ig. 2. 

The measuring circuit uses the heterodyne principle,’ which provides 
the high degree of frequency discrimination required for precision meas- 
urements and the ease of operation by self-tuning. Heterodyning also 
translates the phase of the unknown from the variable frequency to a 
constant low intermediate frequency at which phase shift can be ac- 
curately detected. The principle of phase detection is based on measure- 
ment of the time interval corresponding to the displacement of a sine- 
wave zero crossing caused by the unknown, as discussed in more detail 
in Section IV. This method has the advantage of good accuracy inherent 
in the measurement of time by counting techniques and also the ease of 
automatic readout of phase shift by translation from time units. 


Zz Z1 
ie a an a 
| I | | 
E 
i oF . i 
y ane y an 2 
(a) (b) 


E IN (a) IS IDENTICAL TO E IN (b) 
1 E, 

INSERTION LOSS (DB) = 20L06 |-=* 
2 


INSERTION PHASE SHIFT = ZE,— ZE, 


Fig. 1 — Definition of quantities measured. 
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INSERTION LOSS: S ADJUSTED FOR E3= Eos 
THEREFORE X= S (DB) 


INSERTION PHASE SHIFT OF S=0° 
THEREFORE LE, = ZE, (FIG. 1) 


INSERTION PHASE SHIFT OF X = ZE,—ZE3 





Fig. 2— Basic block diagram. 


Alternate connection of the loss detection circuit to the standard and 
test channels? * is made at a relatively rapid rate by the use of mercury 
relays driven at 13 cycles per second. This fast switching method results 
in minimizing errors caused by magnitude instabilities in the measure- 
ment signal source which are prohibitive in manual switching methods. 
In phase measurements, where magnitude changes are less critical, the 
alternate connection of the phase detection circuit to the two channels 
is made at a slower rate to allow for an accurate measure of time. The 
overall circuit is illustrated in two sections by the block diagrams of 
Figs. 3 and 4. 

For loss measurements a single master switch sets up the connection 
of Fig. 3 and the timed driving voltages for the switches. The signal at 
frequency F is applied through both the standard and unknown channels 
of the comparison unit, whose outputs are connected alternately to the 
input of a wideband amplifier. Approximately equal detection sensitivity 
for various values of loss of an unknown is attained automatically by 
ganging the amplificr gain controls to the standard attenuator control 
switches so that the level of the signals supplied to the detector is main- 
tained constant to within 1 db. The loss detector then compares the 
magnitudes of the two signals and indicates their inequality on its meter. 
When the attenuator is adjusted for zero meter reading, the loss of the 
unknown is read directly from the attenuator dials. Details of the loss de- 
tection system are given in Section ITI. 

For phase measurements, the master switch automatically connects 
the constant phase attenuator in place of the loss attenuator, connects 
the output signals of the two channels to the phase detector in place of 
the loss detector, as shown in Fig. 4, and changes the timing of the circuit 
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Fig. 3 — Block diagram of loss measurement circuits. 
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relays. The difference in phase of the two signals is now detected and 
indicated on an in-line readout. The constant phase attenuator is com- 
pensated so that at any setting it introduces phase shift in the standard 
arm exactly equal to the phase shift in the unknown arm for “‘zero”’ set- 
ting, i.e., with a patch cord inserted in place of the unknown. The indi- 
cated phase is thereby the phase shift due to insertion of the unknown. 
Ganging of the constant phase attenuator to the loss attenuator auto- 
matically maintains the magnitude equality of the loss measurement and 
thereby eliminates possible errors in phase detection due to unequal 
signal magnitudes. The details of the phase-detection system are also 
described below. 


I. MEASURING SET DESIGN 


The component chassis of the set are mounted in a console assembly 
of three relay rack cabinets as shown in Fig. 5. Particular emphasis has 
been placed on the arrangement of the chassis in the cabinets so that the 
controls of oscillators F'1 and I'4 and of the comparison units, which are 
the ones that are operated in the normal measurement procedure, are 
within easy reach of a seated operator. The loss balance, phase, and level 
indicators are also located for ease of viewing. Accessibility to chassis 
maintenance test points and to components for replacement is provided 
by mounting the most critical chassis on sliding and tilting chassis tracks. 
Access to rigidly mounted chassis is permitted through rear cabinet 
doors and in some instances through hinged front chassis covers. 

The function of the individual units in relation to the overall circuit 
and some of the significant design considerations are discussed under the 
following headings: 


II. STANDARD CONTROLLED HETERODYNE OSCILLATOR (SCHO) 
III. LOSS DETECTOR 

IV. PHASE DETECTOR 

V. REVERSIBLE TIMER 

VI. COMPARISON UNIT 


II. STANDARD CONTROLLED HETERODYNE OSCILLATOR (SCHO) 


The SCHO is the most complex part of the measuring set and includes 
seven separate units. Six units occupy the complete upper part of the 
left-hand bay and one unit is at the top of the central bay as shown in 
Fig. 5 
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Fig. 5 — The assembled loss and phase measuring set. 
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It is found convenient to treat the discussion of the SCHO under the 
following sub-headings: 


2.1 General Description and Requirements of the SCHO 
2.2 Functions of Automatic Frequency Control (AFC) Loops 
2.3 Design Considerations of AFC Loops 


2.1 General Description and Requirements of the SCHO 


The need to accurately measure the characteristics of networks having 
steep transmission and phase slopes prescribes stringent frequency ac- 
curacy requirements on the oscillator. To obtain an accuracy of +-0.002 
db in the presence of a 2 db per kilocycle slope, a frequency accuracy of 
+1 cps must be maintained. To apply the heterodyne principle, the os- 
cillator must provide dual output frequencies: the test frequency and a 
slave or offset frequency equal to the test frequency minus a constant (A). 

The design of the oscillator for this set is based on the principles of a 
high frequency standard controlled heterodyne oscillator (SCHO).* 
While the SCHO is a complex structure, the setting of the test signal to 
any desired frequency in the range of 10 to 300 ke accurate to +1 cycle 
is a simple process. The film scale is rotated to the 1 ke mark below the 
desired frequency and the dials of the four decades of the interpolation 
oscillator are set to the remaining digits of the desired frequency. 

Essentially the oscillator employs local oscillators that can be set to 
produce the desired test and offset frequencies. These are then auto- 
matically and continuously maintained to the desired precision by ref- 
erence to three standard frequency sources, two internal and one external, 
as shown in Fig. 6. The output frequencies, F and F — A, are generated by 
a variable oscillator, F1, and two controlled oscillators, F3 and F5. F 
is the beat between Fl and F'3 while F — Ais the beat between F1 and F5. 
These beats are produced in modulator amplifiers 1 and 2 respectively. 

As explained below in more detail, the frequency of F is controlled to 
a multiple of the external 1 ke standard through an intermediate con- 
trolled oscillator, F2. The frequency of F is then shifted to the required 
exact frequency by a variable interpolation oscillator, F4. Finally, the 
offset frequency is produced by fixed oscillators, F6 or F6’, of the ap- 
propriate A frequency which control the frequency difference between 
F3 and F5. 

Fl is a stable tuned-grid oscillator? with a film scale 100 inches in 
length calibrated in terms of F at every 10 ke and further subdivided 
every 1 ke. 

F2, F3 and F5 are tuned-grid reactance tube! controlled oscillators. 
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Fig. 6 — Block diagram of standard controlled heterodyne oscillator (SCHO). 
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I*4 is a precision RC decade oscillator. 

Oscillator I'6 is a tuning fork oscillator, while F6’ is a crystal type. The 
need for the use of the two offset or A frequencies is discussed in Sections 
III and IV. 


2.2 Functions of Automatic Frequency Control (AFC) Loops 


The discrete control loop I provides voltage to control the frequency 
of oscillator F2 by using the 1 ke standard as a reference. I2 is controlled 
so that the difference F2 — Fl is an exact multiple of 1 ke. As F1 is set 
to within +200 eps of any 1 ke multiple, as indicated by its calibrated 
film scale, 2 is shifted in frequency and phase locked to F1 by loop I 
as required. 

The interpolation control loop II provides voltage to control the fre- 
quency of oscillator I°3 by using the interpolation oscillator F4 as a ref- 
erence. I°3 is controlled so that the difference (3 — F1) is the chosen 1 
ke multiple plus an amount 0 to 1 ke determined by the dial settings of 
oscillator F4. 

The offset control loop III provides voltage to control the frequency 
of oscillator 5 by using oscillator 6 or F6’ as a reference. F5 is con- 
trolled so that the difference F3 — I5 is equal to the frequency, A, of 
the chosen reference oscillators: F6 = 277.78 cps or F6’ = 2500 eps. 
Since the offset output is F5 — F1, it is equal to (F3 — A) — (F3 — F) 
or fF — A. 


2.3 Design Considerations of AFC Loops 


Modulation requirements of the SCHO oscillator are severe for both 
loss and phase measurements. For loss measurement errors not to exceed 
0.001 db, amplitude modulation must be held to —80 db. Frequency 
modulation due to the operation of loop III introduces errors in phase 
measurement in a more complex manner. Addition of sideband frequen- 
cies to the F and F' — 277.78 cycle signals fed to the modulators of the 
phase detector (block diagram in Fig. 7) will introduce a spurious 277.78 
cycle signal in the modulator outputs. This spurious signal produces 
error by shifting the zero crossings, the error reaching a maximum for a 
90° phase shift of the spurious component. To meet the circuit objective 
of 0.01 degree, the level of this interference must be 75 db down. 

One source of this interference may be by frequency modulation of F5 
of Fig. 6 due to insufficient filtering in loop III. This would allow trans- 
mission of frequencies A (277.78 cps) and 2A (555.56 cps) from the phase 
detector of loop ITI, Fig. 6, to the grid of the reactance tube that controls 
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oscillator 5. The maximum amplitude of this modulating voltage (vm) 
must be less than 40 microvolts as shown in the Appendix. This low value 
prescribes complete, high quality shielding between the components of 
the frequency control circuitry of loop III and adequate filtering in the 
loop without introducing excessive delay. 

From the description of loop functions, it will be seen that 5 must 
change in frequency by at least 1.5 ke, since it must compensate for fre- 
quency drifts of the order of 0.5 ke in F1, F2 and F8, as well as cover the 
1 ke range of F'4. In addition to this wide control range, the offset loop 
III must maintain the phase difference between the test and offset sig- 
nals to within 0.01 degree during the phase measurement time interval 
to meet the objective of circuit accuracy. 

The loop includes both frequency discriminating and phase detecting 
circuits and the reactance tube control of F5. The desired control was 
attained by careful design of the following factors: (a) the Q and thereby 
the frequency slope of the discriminator is made as high as possible with 
readily available components; (b) the reactance tube is operated at the 
point in its characteristic which produces maximum change in its output 
capacitance with a minimum change in grid voltage supplied by the 
phase detector; and (c) added gain is kept at a minimum to prevent loop 
sing but still sufficient to maintain control. 


III. LOSS DETECTOR 


The loss detector circuit is illustrated by the block diagram in Fig. 3. 
The detector is a self-tuned null balance type which produces an output 
proportional to the difference in loss of the standard and unknown 
branches of the comparison unit in Fig. 3 with an unbalance sensitivity 
of 0.001 db. The operation of the detector circuit will be described first 
and then the factors governing the choice of constants will be considered. 


3.1 Detector Operation 


The envelope of the test signal varies rectangularly in level (a of Fig. 3) 
as the comparison switch alternately connects the signal from the loss 
attenuator and unknown branches of the comparison circuit to the loss 
detector. The variable frequency signals are translated to a 2500 cps 
fixed intermediate frequency (b of Fig. 3), amplified, and detected with 
a parabolic detector. The output of the parabolic detector contains a 
rectangular component (c of Fig. 3), corresponding to the rectangular 
envelope of the carrier, and the much larger carricr component which is 
minimized by filtering. The 2500 cps carrier component is about 80 db 
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greater than the rectangular component produced by an 0.001 db in- 
equality of the S and X signals. 

Transients of variable magnitude are superimposed on the rectangular 
signal (c of Fig. 3) when the comparison switch is closed. Therefore, for 
true balance information, the output of the filter is sampled for a very 
short interval (d of Fig. 3) near the end of the dwell time of the compari- 
son switch on each side. The capacitor Cg in the sampling circuit now 
contains a rectangular component? ? (e of Fig. 3) at the switching fre- 
quency of 13 cycles. The amplitude of this component is proportional to 
the amount of unbalance between the standard and unknown paths, and 
the polarity with respect to the mean value is dependent on the sense of 
unbalance. This signal is then amplified, (f of Fig. 3) rectified, and the 
rectified voltage applied to a zero center de meter. The meter deflection 
is now proportional to the loss unbalance and its direction from mid-scale 
indicates which of the two signals is the larger and thereby indicates 
whether the loss of the standard must be increased or decreased to attain 
balance of the signals. The balance indicator provides a relatively con- 
stant deflection for the same loss unbalance in db, independent of the 
absolute magnitude of loss, by maintaining the level of the input to the 
detector relatively constant as discussed in Section VI. 


3.2 Detector Constants and Design Considerations 


The choice of intermediate frequency, bandwidth, switching frequency 
and measurement period is based on the following considerations. The 
demodulator is a square-law rectifier detector followed by a low pass 
filter which must (1) transmit a rectangular envelope component, (2) 
provide a high attenuation to the intermediate frequency, and (8) have 
a rapidly decaying transient response. 

The filter design problem becomes quite difficult for low values of in- 
termediate frequency, so that a high frequency is desirable. However, if 
a high frequency is chosen, its low order harmonics may fall within the 
measurement band of 10 to 300 ke. It is difficult to provide sufficient iso- 
lation to prevent pickup of these harmonics from the IF amplifiers of 
the tuner through parasitic paths, such as power supplies or common 
grounds, back to the wide band amplifiers of the comparison unit or to 
units of the SCHO. Such pickup would result in measurement errors 
since wanted and parasitic signals could add in one position of the switch 
S1A of the comparison unit and subtract in the other position. An inter- 
mediate frequency of 2.5 ke was selected as a compromise value. Since 
only fourth and higher harmonics of 2.5 ke are in the measurement band, 
sufficient filtering to remove interference is less difficult. 
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The choice of bandwidth of the intermediate frequency is a compromise 
between (1) a narrow bandwidth for a high degree of frequency dis- 
crimination and thereby a high signal-to-noise ratio for good circuit 
accuracy and (2) a wide bandwidth to allow a fast decay of switching 
transients and thereby minimize their effect on circuit accuracy. To limit 
measurement noise below 0.001 db, a signal-to-noise ratio of 80 db is 
required. Bandwidth was fixed at 600 eps, which resulted in an 86 db 
ratio at point b of Figure 3 and a decay time of approximately 20 milli- 
seconds. 

The choice of switching rate of 13 cps is a compromise between (1) 
fast speed to minimize level changes of the signal source during compari- 
son of the two paths and (2) slow speed to allow ample time for transient 
decay and to relax the accuracy requirement of the sampling timing in 
relation to the comparison time. In addition, the 13 cps rate was chosen 
to minimize the effect of pickup that could occur at a submultiple of the 
60 eps power frequency. 


IV. PHASE DETECTOR 


It is convenient to divide the discussion of the phase detector into four 
categories: (a) General description of the principle of phase detection by 
measurement of time intervals, (b) Operation of the phase detector, (c) 
Timing of circuit measuring periods, (d) Design considerations and 
choice of constants. 


4.1 General Description 


As stated in the introduction, the choice of phase detection by the 
measurement of the time interval corresponding to the displacement of 
a sine wave zero crossing was based on the inherent accuracy of time 
measurement and the ease of automatic readout of phase by translation 
from time units. The circuit of the detector is illustrated by the block 
schematic of Fig. 7. The detector measures the phase difference between 
the test and reference signals of Fig. 4. 

The problem of phase measurement over a wide frequency range (see 
Tig. 7) is simplified to the measurement at a fixed frequency by the 
preservation of phase in a modulation process. The considerations in the 
choice of 277.78 cps as the fixed frequency and the choice of a 1 mega- 
cycle counting rate will be discussed later. 

The time interval, beginning with the negative-going zero crossing of 
the reference signal and ending with the corresponding negative-going 
zero crossing of the test signal, is proportional to the phase difference 
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between those two channels. Since the time for 360 degree difference 
equals 1/277.78 second, then 1 microsecond = 0.1 degree and, at a 1 
megacyle counting rate, 1 count = 0.1 degree. 

Circuit zero errors, due to frequency-variable phase difference between 
the unknown and reference paths of Fig. 4, are eliminated by the switch- 
ing of 81 of Fig. 4. The circuit compares the phases of the unknown and 
reference paths and then compares the phases of the attenuator and ref- 
erence paths. Since the reference path is common to both comparisons, 
the resulting measurement is the difference in phase between the un- 
known and attenuator paths. 


4.2 Description of Detector Operation 


As discussed above, the detector must measure the time interval be- 
tween negative-going zero crossings of two sine waves. The time of the 
zero crossings is established by the generation of corresponding sharp 
pulses at the instant of zero crossing and the time interval between pulses 
is measured by a reversible timer which is described in Section V. 

The IF sine wave of each channel (a of Fig. 7) is clipped and amplified 
by three limiters in cascade which convert the signal to a fast-rise square 
wave (b of Fig. 7). The shock-excited triode oscillator is driven to cutoff 
by the fast negative-going square wave. Oscillations are produced in the 
tank circuit coil together with its distributed capacitance and a shunt 
diode damps out all but the first half sine wave. The resultant output 
pulses (c of Fig. 7) are 0.5 ws wide, 35 volts in magnitude and have a rise 
time of 0.2 ws. A relay in the ON pulse branch is controlled so that it 
closes approximately 0.2 second after the X-S switch of the comparison 
unit closes, thereby closing only after the switching transients have dis- 
sipated. 


4.3 Timing of Circuit Measuring Periods 


The direct reading of phase measurement is accomplished by the fol- 
lowing sequence of automatic switching as illustrated by the diagram of 
Fig. 8: 

1. The comparison switch S1A of Fig. 4 is switched to X and the timer 
is switched to ‘‘add’”’ by switch S1B. No pulses reach the timer since 
relay RL1 of Fig. 7 is open. 

2. An interval of 0.2 second is allowed for decay of transients intro- 
duced by closing S1A. 

3. Relay RL1 is closed allowing transmission of pulses to the timer. 

4, The timer measures the total time interval of 100 ON-OFF pulses 
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Fig. 8 — Time diagram of phase measurement circuit operation. 


from the detector output. The timer gate opens at the 100th OFF pulse 
and the count is stored. The maximum time (and count) is 0.36 second 
corresponding to 360 degrees as discussed above. 

5. Relay RLI is again opened. 

6. The comparison switch S1A of Fig. 4 is switched to S and the timer 
is switched to “subtract” by switch S1B. 

7. An interval of 0.2 second is allowed for decay of transients. 

8. Relay RLI1 is closed. 

9. The timer measures the total time interval of 100 ON-OFF pulses 
and subtracts it from the count stored in 4. The timer gate opens at the 
100th OIF pulse and the remaining count is stored. 

10. After 0.14 second, a relay in a digital translator is closed and the 
translator setting, corresponding to the stored count of the timer, is dis- 
played by an in-line numerical indicator. 

11. After an interval of 2 to 10 seconds, as desired, the entire procedure 
from 1 to 10 is repeated. 


4.4 Design Considerations and Choice of Constants 


To attain 0.01 degree accuracy of phase measurement, the choice of 
Il’ frequency (A) in the phase detector, counting rate, and the number of 
periods measured becomes highly critical because these factors are closely 
interrelated and also because they are subject to the performance of the 
SCHO oscillator. 


As previously discussed, minimizing errors due to FM modulation 


LOSS AND PHASE MEASURING SET 1509 


makes the design of the AFC loop III of the SCHO particularly difficult 
when A is small. However, since phase is to be measured by conversion 
to time, a large A requires a high counting frequency (or rate) F', . Prob- 
lems of circuit phase ‘‘zero,”’ as previously described in Section 4.2, 
prescribe a reversible counter which is increasingly difficult to design for 
higher rates. To obtain direct reading phase measurements in decades 
from 0 to 360 degrees, /’p must be 10” where n is an integer and A = 
F/360. Based on the above considerations, the selected constants were: 
Fr, = 1.0 me and A = 277.78 cps from which 1.0 degree = 10 microsec- 
onds. Measurement to 0.01 degree, under the ideal condition of no phase 
drift or phase jitter during the measurement time, requires the measure- 
ment of 10 periods. In practice there is some phase jitter, of the F — A 
signal in relation to the F signal, produced by the high gain of the AIC 
loop III of the SCHO which in turn is needed to control frequency over 
the prescribed range. To compensate for this jitter, 100 periods are 
measured and the last digit (0.001°) is not displayed. In effect, the dis- 
played value of phase is an average of 100 counts for measurements of 
phase. 

The IF bandwidth of the detector must be as narrow as possible to 
provide sufficient frequency discrimination and at the same time must 
be wide enough so that switching transients decay rapidly. The choice of 
30 cps bandwidth proved adequate to achieve the aim of 0.01° phase 
accuracy and permits transients to decay within 0.1 second to a suffi- 
ciently low magnitude that accuracy is not affected. As previously de- 
scribed, 0.2 second in the switching time was allowed for transient decay 
which resulted in a total measurement time of 0.926 second. As previ- 
ously stated, the phase lock of the SCHO is held to better than 0.01 de- 
gree shift in this total measurement time. 

If the phase shift being measured is less than the phase shift through 
the constant phase attenuator, the reverse count will exceed the forward 
count and the difference will be negative. For a negative number n, 
however, the timer will indicate 1000.00-n: thus, —15.87° is indicated 
as 1000.00 — 15.37° or 984.63°. To avoid the necessity for arithmetic 
subtraction by the operator, the lead-lag switch (lig. 7) has been pro- 
vided. The switch reverses the S-X and reference pulse generator inputs 
to allow the count to appear as a positive number. The operator need 
only record the indicated count with a change of sign. 

If, relative to the reference channel, the total phase shift, through 
standard or unknown path and through modulators and pulse generators, 
lies in the neighborhood of 0 or 360 degrees, the phase measuring circuit 
cannot operate properly. The difficulty is caused by the inability of the 
timer to respond to pulse pairs having a short time separation. In such 
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a case the phase circuit will either refuse to cycle or will give highly 
erratic results. Operation of the zero shift switch (Tig. 7) adds approxi- 
mately 180 degrees in the reference path. Effectively, then, 180 degrees 
is added to both S and X measurements and the difference angle is un- 
changed. 

As discussed in Section 2.3, sidebands of 277.78 cps on the signal ap- 
plied to the phase detector may cause phase measurement error. Simi- 
larly, pickup in the detector of 277.78 cps and crosstalk between the two 
channels in the detector can produce phase error. The components of 
each channel were mounted in separate shielded compartments and at 
each point of entry of a power lead into a compartment heavy bypass 
capacitors were installed. The lead-lag switch required complete shield- 
ing including shields between each switch section, grounding of inter- 
connecting wires in their alternate position and the use of shielded cables 
between the switch and the circuit connecting points. 


V. REVERSIBLE TIMER 


A block diagram of the reversible timer is shown in Fig. 4. The timer® 
measures the time difference between interval X, the time separation of 
input pulse pairs when the unknown network is in the measurement cir- 
cuit, and interval 8, the corresponding interval when the phase attenua- 
tor is in the circuit. 

Input ON and OFF pulses, appearing on the separate input leads from 
the phase detector (lig. 7), are applied to an electronic switch whose 
output is a rectangular pedestal voltage with leading and trailing edges 
coincident with the leading edges of the ON and OFF pulses. The pedes- 
tal voltage is applied to a gate which opens during the pedestal interval 
and transmits 1 me voltage to a group of six tandem reversible decades. 
Reversal of the direction of count is accomplished by the use of interstage 
gates within the decade counting units actuated from the S-X switch 
S1B of Fig. 4. An auxiliary + 100 counter controls the electronic switch 
so that exactly 100 time intervals are measured for each direction of 
count. 

The above six counting decades, measuring for 100 periods, provide a 
count to 0.001 degree phase. As described in Section 4.4, this last decade 
may jitter at random so that the output of only the first five decades are 
translated to decimal form and transmitted to an in-line readout display. 


VI. COMPARISON UNIT 


The comparison unit is designed around the parallel comparison type 
circuit shown in Figs. 3 and 4. The method of detecting the equality in 
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Fig. 9 — Comparison unit, showing the ganged level control. 
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Fig. 10 — Comparison unit, showing ganging of the loss and constant phase 
attenuators. 
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magnitude and the difference in phase of the signals from the two paths 
was discussed in Sections ITI and IV. The construction used to attain the 
circuit objectives is shown in Figs. 9 and 10 and some of the special fea- 
tures are described below. 

A constant sensitivity of 0.001 db when measuring losses in the range 
from 0 to 70 db prescribes a constant signal level to the loss detector. 
This is achieved by the use of step switch gain controls in the wideband 
amplifier, ganged by chain and sprocket to the attenuator switch shafts, 
as shown in Fig. 9. As the unknown loss and, therefore, the attenuator 
loss increases in steps of 1 db and 10 db, the gain of the amplifier is cor- 
respondingly increased over a 70 db range, thereby maintaining the level 
to the detector constant within 1 db. 

Measurement of 40 db loss to a sensitivity of 0.001 db corresponds to 
0.44 microvolt difference of the signal input to the amplifier and pre- 
scribes special precautions to minimize noise and pickup in the amplifier. 
Power frequency pickup is minimized by operating all heaters on filtered 
de and by locating power supplies well away from areas sensitive to 60 
cycle field. Microphonics are reduced by shock mounting both the mer- 
cury relay S-X switch (S1 of Fig. 3) and the input preamplifier. Thorough 
shielding and decoupling reduce high frequency crosstalk sufficiently to 
attain the desired measurement accuracy. 

To attain the circuit objectives, the standard attenuator must provide 
insertion loss in the range of 2 to 122 db adjustable to within 0.001 db 
for loss measurements and to within 0.1 db for phase measurements. The 
insertion loss must meet accuracies to nominal of +0.001 to 42 db; 
-+0.01 to 72 db and +0.5 to 122 db at frequencies from 10 to 300 ke. In 
addition, the insertion phase shift at any setting in the range of 2 to 42 
db must be equal to the insertion phase shift at any other setting in that 
range to within 0.01 degree at any frequency from 10 to 300 ke. The loss 
accuracies are met with an attenuator embodying precisely adjusted, 
stable wire-wound resistors; however, the phase shift of this type at- 
tenuator varies with setting by as much as 0.5 degree. A second attenua- 
tor is used embodying deposited carbon resistors in a semicoaxial struc- 
ture. Adjustment of internal capacitor trimmers for each step attains the 
required constant phase shift. The two attenuators are ganged, as shown 
in Fig. 10, so that the equality in magnitude of the signals from the two 
circuit arms attained in the loss measurement is held to within 0.1 db for 
phase measurement and the signal level from either arm to the phase 
detector is constant to within 1 db for loss of the unknown from 0 to 
70 db. 

Additional interlocking of the shafts of the 0 to 6 X 10 db and 0 to 
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5 X 10 db switches (2 right-hand knobs in Fig. 10) is required to avoid 
more complicated ganging to the amplifier gain controls. The interlock 
prevents moving the right-hand knob until the second switch is in the 
60 db position, thereby maintaining constant detector input level for 
losses up to 70 db. Additional amplifier gain for the higher losses added 
by the right-hand switch is not needed since sensitivity requirements are 
comparably relaxed. 


VII. ACCURACY OF MEASUREMENTS 


7.1 Loss 


The accuracy of loss measurements was verified by measuring a preci- 
sion 75 ohm attenuator of 0.001 db accuracy. The accuracy of this check- 
ing standard was established by first calibrating at de by measurement 
of voltage ratios with a precision potentiometer and then measuring the 
loss change with frequency up to 20 mc using a circuit capable of 0.01 db 
accuracy. Since for any value up to 40 db the loss change was less than 
0.1 db from de to 20 me and was linear within the measurement accuracy, 
by interpolation the loss change of the checking standard should be less 
than 0.001 db from de to 200 ke. A typical set of measurements of the 
checking standard using the 300 ke set is given in Table I. 

As shown by the table, the design objectives were met for the measure- 
ment set of 0 to 40 db +0.002 and 40 to 60 db +0.02. 


7.2 Phase 


The accuracy of phase measurements was verified by comparing phase 
shift of coaxial cable as measured on the set with values computed from 
impedance measurements made on a precision bridge. 

Western Electric 724 coaxial cable was found to have the necessary 


TABLE I 





Measured Loss (db) at 
Attenuator 
Setting (db) 

















10 ke 35 ke 105 ke 200 ke 300 kc 
14 14.001 14.000 14.000 13.999 13.999 
24 23.998 23.998 23.998 23.998 23.998 
34 34.000 33.999 33.998 33.998 33.998 
44 43.999 43.999 43.998 43.997 43.997 
54 54.001 54.001 54.000 53.998 53.996 
64 64.002 64.001 63.999 63.993 63.991 
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time and temperature stability. Open circuit and short circuit impedances 
were measured on a 1200 foot length of cable. From the bridge measure- 
ments input and output image impedances were computed using: 


Z; = Vv Loe Lee . 


Image impedance rather than characteristic or iterative impedance 
was used since the cable is not exactly symmetrical. 

Insertion loss and phase shift between image impedance terminations 
were computed as follows: 


tanh 06 = VWZseZoo = U+9V 


then the insertion loss is 


and the phase is 


oe ai oo 
B 3 tanh eel: 


To avoid mismatch effects between cable and the measurement sct, 
L type matching pads were constructed for each calibration frequency. 
The cable plus pads was measured and then the pads only with the cable 
removed. The difference in measured values then was recorded as the 
value of the cable. 

A comparison of some measured and computed values is given in 
Table II. 

Some errors inherent in bridge measurement and in constructing im- 
pedance matching pads may contribute as much as 0.01 degree of the 
above 0.03 degree maximum differences. Allowing for this uncertainty, 
the measurement accuracy of the circuit is considered to be +0.02 degree 
over the frequency range of 70 to 150 ke. Since bridge measurement ac- 
curacies at frequencies above 150 ke do not provide sufficient accuracy 
of calculated phase, further checks were made in the following manner. 











TABLE II 
Frequency (kc) Computed (Degrees) Measured (Degrees) Difference (Degrees) 
70 58.94 58.91 —0.03 
90 75.40 75.41 +0.01 
120 99.92 99.91 —0.01 
130 108.10 108.10 0.00 
140 116.26 116.25 —0.01 


150 124.30 124.33 +0.03 
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Tour lengths of cable totaling 897 feet were prepared with coaxial fit- 
tings. Five impedance isolating pads were prepared each with coaxial 
fittings such that the cables could be inserted directly between the pads 
without additional fittings. Measurements of loss and phase were made 
of the four cable sections in tandem isolated from each other and from 
the set by pads and then only the pads in tandem were measured. The 
difference then is the total loss and phase of the four cable sections. 

Each individual cable section was measured when terminated with the 
same pads that terminated the specific section when the tandem measure- 
ment was made. Again the pads only were measured, the difference giving 
the loss and phase of the individual sections. 

Values of the measured sum compared to the calculated sum are given 
in Table III. 

The above differences are derived from the individual measurements 
as outlined above and there is an uncertainty in each measurement of 
0.001 db and 0.01 degree which may add in a random manner to 0.003 db 
and 0.03 degree. Since the above calculated differences are no greater 
than the random errors, the set capability for measuring a single value 
is considered to be +0.002 db and +0.02 degree. 


VIII. CONCLUSION 


The design effort has been directed toward achieving laboratory pre- 
cision in measurement and, at the same time, maintaining the ease of use 
and speed necessary for a large volume of measurements. The entire 
frequency range of the set described in this article is covered without 
band switching by the use of a heterodyne signal oscillator which provides 
good frequency accuracy by locking to a frequency standard. The need 
for manual detector tuning is eliminated through the use of frequency 
conversion employing a locked slave (or offset) frequency source to pro- 
vide a constant detection frequency. The desired test frequency can be 
set and read directly from the markings of a film scale type master os- 
cillator and the dials of an interpolation oscillator. The insertion loss and 











TaBe II] 
150 kc 300 ke 
Loss Phase Loss Phase 
Measured........... 0.872 79.46 1.179 7 156.67 
Calculated.......... 0.875 79.44 1.178 156.64 





Difference. ......... 0.003 db 0.02 deg. 0.001 db 0.03 deg. 
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phase of an unknown may be read directly from the dials of an attenua- 
tor and from an in-line digital readout, respectively, and since the system 
zero is independent of test frequency, no circuit zero is required. 
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APPENDIX 
Limitations of Frequency Modulation in SCHO Oscillator 


The introduction of errors in phase measurement by frequency modu- 
lation of the F and F — A frequencies of Fig. 6 was discussed in Section 2.3. 
The derivation of the limitation of the amplitude of a spurious modulat- 
ing signal is as follows: 

The output of a frequency-modulated oscillator of very low modula- 
tion index (my) contains only one pair of sidebands and is well known 
(see Ref. 6 and similar texts) to be expressed as 


cm aL (Bf) sina + (2) ino + 9 ~ sino —nra] a) 


Son 
where 
Af = frequency deviation 
fm = modulating frequency 
J (2) = Bessel function of the first kind, order n. 
Af 


For 7, near zero (1) reduces to 


e= Ky (sin wl + x [sin (w + p)t — sin(w — p)Al) 
The maximum phase modulation is then: 


Ay = 2fm/Af 
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but 
Af = kum 
where 
k = reactance tube oscillator sensitivity in cps per volt 


2.3 X 10? eps/volt for 5 (Fig. 6) 


Vm = amplitude of modulating voltage 
thus 
= Jim 
ay 1.2 X 10%," 


Tor a phase error of 0.01 degree, AY = 0.01 X (2/180) radians. Hence, 
the ratio of carrier to sideband amplitude must be greater than 


te 2. 3 
Ag eS 10° or 75 db 
and for fm = 278 eps, the maximum allowable modulating voltage is 
278 _ 
Um = T2 «x 10° x 5.7 x 10° = 40 microvolts. 
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Thermoelastic Stresses in Balanced 


and Unbalanced Seals 


By T. D. RINEY and J. W. ELEK 
(Manuscript received April 28, 1961) 


This paper presents the results of a two-part analytical study of the 
stresses produced when ceramic cylinders are butt sealed to metal washer 
plates. Such stacked structures are of increasing importance in the fabrica- 
tion of encapsulations for electron tubes and semiconductor devices. 

Practical experience, rather than analysis, has shown the advantage of a 
balanced seal and zt is now in common usage in the electronic industry. In 
some applications other requirements may prevent direct back-to-back balanc- 
ing and the question arises as to how much the diameters of the opposing 
seals may differ and still give the desired balanced effect. In Sections 3.1, 
4.1, and 5.1 this question 1s considered and the concept of a balanced seal 
is placed on a firmer basis. The results indicate that a small difference in 
the radii ts to be preferred. 

The effect of the length of an intermediate cylinder on the stresses in a 
three-cylinder stacked structure is considered in Sections 3.2, 4.2, and 6.2. 


I. INTRODUCTION 


In the electronic industry ceramic-to-metal seals are of increasing im- 
portance in the fabrication of encapsulations for electron tubes and semi- 
conductor devices. The most common method for sealing ceramics to 
metals is to first ‘‘metallize’ the ceramic by sintering onto it a thin coat- 
ing of metallic powder; common brazing materials such as gold-copper 
alloys, silver-copper alloys, etc. are then used to fasten the metallized 
ceramic to the metal. 

Temperatures in excess of 500°C are required for the brazing operation. 
As the assembly cools from the set-point of the brazing material to room 
temperature, thermoelastic stresses are induced in the bonded com- 
ponents by the differences in their thermal contraction rates. These 
stresses must be controlled if physical distortion and the resultant possi- 
bility of bond failure or fracture of the metal or ceramic are to be avoided. 
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Their magnitude depends on the choice of metal, ceramic, and brazing 
material as well as the applied heat treatment. If it is assumed that these 
parameters are held fixed, the magnitude of the thermoelastic stresses 
will still vary widely as a function of the structure geometry. 

This paper presents the results of a two-part analytical study of the 
stresses produced when ceramic cylinders are butt sealed to metal washer 
plates. The problems considered are represented by the electron tube 
envelope assembly depicted in Vig. 1. At position A the top ceramic 
cylinder is used to balance the effect of the ceramic-metal seal on the 
other side; it serves no other purpose. Practical experience, rather than 
analysis, has shown the advantages of a balanced seal and it is now in 
common usage in the industry. In some applications other requirements 
may prevent direct back-to-back balancing (as at position C) and the 
question arises as to how much the diameters of the opposing seals may 
differ and still give the balanced effect. In the first part of the paper this 
question is considered and the concept of a balanced seal is placed on a 
firmer basis. 

At position B the question arises as to the effect that the length of the 
middle cylinder has on the stresses produced. This problem is treated 








Ree | CERAMIC 
(SSSSSS) METAL 


Fig. 1 — A cross-sectional view of a representative ceramic-metal electron 
tube envelope. 
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in Sections 3.2, 4.2, and 5.2. A similar study of the effects of the inter- 
mediate lengths in a graded cylindrical butt seal has been reported.!~4 
Another related problem is a long pipe reinforced by rings and submitted 
to the action of uniform internal pressure.> The thread that holds to- 
gether these problems and the problem considered in Sections 3.1, 4.1, 
and 5.1 is that the stresses are produced by bonding together parts that 
would ordinarily be discontinuous at the joints under the given condi- 
tions. The continuity is produced by distributed forces and moments 
acting on each component along the joint. 


II. ASSUMPTIONS AND NOTATIONS 


Discontinuity stresses are highly localized; they are attenuated rapidly 
as the distance from the joint increases. Certain of the dimensions of the 
structure may therefore be idealized as infinite in extent without signifi- 
cantly affecting the stress distribution. The calculations are thereby 
greatly simplified. The idealized configurations for the two parts of the 
study are shown in Fig. 2. 

The metal and ceramic bodies are imagined to be sealed together at a 
temperature 7), the temperature then lowered to room value, 7’. It 
is assumed that no external restraining forces act on the structure dur- 
ing the sealing and cooling. The distributed forces, Q;, and bending 





Fig. 2 — Cross-sectional view of the structures idealized to study: (a) the ef- 
fects of a balanced seal; (b) the effect of varying l. 
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moments, M,;, required to maintain continuity at the joints of the 
ceramic-metal seals are depicted in the exploded diagrams of Fig. 3. The 
positive direction for the deflections and displacements are also shown. 
To evaluate the edge loadings it is expedient to refer to the literature 
for the appropriate expressions for the deflections, displacements and 
slopes. This will be done freely. The formulas are based on the membrane 
theory of cylinders and plates, i.e., diameter to thickness ratio = 5. 
It is also assumed that the deflections are small enough that the radial 
loads on the plates have negligible bending effects. In using these ex- 
pressions the following notations will be employed: 
_ Eche: eee rs _ 
in 12(1 — v2) me 12(1 — vy?) se 
As: ie LE, h. 4 Li, h, 
“dep. © ~ app, 


S|5 


Dp 


(1) 
By A= Ea a ap) (T , To) 
where £ is the elastic modulus; v Poisson’s ratio, and the subscripts c 
and p indicate cylinder and plate, respectively. 

Once the edge loadings have been calculated the results will then be 
used to evaluate the stresses that they produce in the ceramic cylinders. 
The notations used for the stresses are illustrated in Fig. 4. In calculating 
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— Q, eee 
PLATE M My 
NO.1 SGnneR 
«Ceca ZZ = NO. ig Aa Me 
0 Q, | Ee, 
Noe eae Gi a= 
Emm hens 
Wp 
| 
Qs<f | 
CYLINDER H 
0.2 | 
4 
| 
] 
(b) 


Fig. 3 — Exploded views showing the notations used for the moments and de- 
flections. 
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the stress values the literature on cylindrical shell theory uses the fol- 
lowing standard notation: 


eo = e (cos Bx + sin Br) 6 =e * cos Br es 
= ¢ (cos Bx — sin Bx) ¢ = 6 sin Bx 
and 
eA cosh Bl + cos fl ea BI sinh 61 — sin £l 


sinh Bl + sin Bl sinh 61 + sin Bl 


cosh Bl — cos £1 
sinh Bl + sin Bl ° 


_ (8) 
x3(6l) = 


III. FORMULATION OF PROBLEMS 


The problem considered in the first part of this and the following two 
sections is essentially the effect of radial unbalance in butt seals. The 
second is concerned with an axial unbalance resulting from the finite 
length of the intermediate cylinder. At this point it seems desirable to 
treat the problems separately. 


3.1 Radial Unbalance 
Static equilibrium at the joint r = b, Fig. 3(a), requires that 
Q2 = Q1+ Qs M, = M, — M; (4) 


Ox, «Te 





Fig. 4 — Cylindrical element showing stress directions. 
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and the quantities to be determined are reduced to six: Qo, Q:, Q3, Mo, 
M,, M;. These will be evaluated by using the boundary conditions that 
at the juncture of the cylinders and plate the radial displacements are 
the same and the plate remains perpendicular to the cylinders. The fol- 
lowing six conditions are obtained: 

(a) The radial displacement of cylinder #1 and plate 1 are equal 

along the junction at r = a. 
(b) The radial displacement of the two plates is the same at r = b. 
(c) The radial displacement of cylinder *2 and plate *2 are equal 
along the junction at r = 6. 

(d) Cylinder ¥1 and plate *1 remain at right angles at their junc- 

ture. 

(e) The slope of the two plates is the same at r = b. 

(f) Cylinder *2 and plate #2 remain at right angles at their junc- 

ture. 

The radial displacement of each Sart consists of a term due to the 
temperature change and a term due to the constraint forces resulting 
from the temperature change. Using the standard notations for displace- 
ments as depicted in Fig. 3(a), the first three conditions may be ex- 
pressed symbolically as follows: 


— (We1) c=0 + aa-(T can To) Cape + aa,(T —< To) 
(1) r= +f ba,(T’ == To) (U2) rao oF bap( T a To) (5) 
— (We2) 2-0 + ba.(T ~ To) = (Uz) rb + ba,(T a7 To). 


From the sketches in Fig. 5(a) it may be seen that the last three condi- 
tions yield the following relationships: 


ye. AWe1 = (“es tice (2s me (Se 
Ya = dz a, dr #26 Yo dr r=b dr r=b 
a= dx z=0 dr r=b ; 


It is now required to express the deflections and their derivatives ap- 
pearing in (5) and (6) in terms of the unknown quantities Q., Q., 
, M;. From Timoshenko (see Ref. 5, p. 393) it is found that 











(6) 





[iMo+ Qo] (wee)eno = ~ 585 a, [B2Ms + Qs] 


dWe1 = 1 dw.2 1 
( dx Mu ~ 282D,. [26:Mo + Qo] ( ae i _) 262D, [28.M3 + Qs]. 


(We1)2=0 aa ~ 2B; Sip, e 
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Fig. 5 — Exaggerated picture of deflections produced by discontinuity forces 
and moments acting at joints. 


The forces Q) and Q; , acting on plate #1 produce radial displacement 
and normal loading given by® 


m= Ar + Ban = phy] B a 


= 5 Sie Vp? 
Here the constants A and B may be determined from the conditions 
that Qp: attains the value Q and Q; at r = a andr = b respectively. 


If this is done and the values are substituted into the above expression 
for Up: , one finds 


a 
~ Eyhy(1 — a?/d*) 


b 
Eyh,(1 — a?/b?) 


(U1) ra {2Q, — Ql(1 — vp)a°/b? + (1 + vp)]} 
(8) 


{Qi(1 — vp) + (1 + vp) a/b") — 2Qo0°/b7} 





(U1) r=» = 


Ifa—-b,b— ~,Q— Q , Q: — 0, then the first of these gives 


- ba + Vp) _ bl + Vp) 
(te) p= = — Eh, Q2 = Eh, (Qi =: Q3). (9) 


The bending moments along the edges of plate #1 produce slopes 
(see Ref. 5, p. 63) 
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(“u=) 7 a 
dr Jr=a  Dy(1 — a2/b?) 
a/b 1 | _ | 1 1 } 
{u[ "+4. ane eg. ee 
(“v= _ b 
dr_ r=b - D,(1 a a*/b?) 


( a/v a’/b | 1 a2/b? 
{an| + | - My rt ee}. 


If we let a— b, b > ~, My — M2, M, — 0, then the first of these yields 











(10) 














‘int bM» b 

= (MV, — MD) . 11 
(“ rb Dp (1 — Pp) ian : s) (11) 
The required six equations relating Qo, --- , 1/3 may now be obtained 


by substituting expressions (7) through (11) into conditions (5) and 
(6). The resulting equations are simplified and only negligible error 
introduced if one sets 














Vp = Ve = 3. 
Then 
3 3 
Dee oe _. ee Sole » ps iat ae = 7 
and the six equations reduce to 
| “38 R 2+ one. R ve ve Mo 
Bih2  3h,1 — a2/b? hy(1 — a?/b?) 3 A : 


a eee ato?) & — 











RQ V V8/3Ms_ _, 
seatset pe |S + hg a? 
2 Qo : 4 3Rhe ae) 9Rhe mM, =o (12) 
oa apy h, 1 — a/b? ~ Ahp( — a/b?) A 
a’ My a = 
ee He aCe a’/b’) = 0 


1 Qs; 3Rhe My Sethe Ms _ 
baz Ah? mt ligt ne |a >” 








THERMOELASTIC STRESSES 1527 


3.2 Axtal Unbalance 
Static equilibrium at the joint r = a, lig. 3(b), requires that 
1 ae Qo + Qs M, = Mo — M, (13) 


and only the quantities Qo , Q2, M/y, M72 remain to be determined. The 
four conditions to be satisfied are: 
(a) The radial displacement of cylinder *1 and the plate are equal 
along the junction at r = a. 
(b) The radial displacement of cylinder *2 and the plate are equal 
atr =a. 
(c) Cylinder #1 and the plate remain at right angles along the junc- 
tion. 
(d) Cylinder *2 and the plate remain at right angles along the junc- 
tion. 
If we use the notations as defined in Fig. 3(b) the symbolic form of 
conditions (a) and (b) become 


— (We1) 20 + aa.(T r= To) = (thy eae + aay( T oars To) 
— (Wee) 2=0 + aa(T <= To) = yp ae + aa,( T = Ty). 


T’rom the sketch shown in Fig. 5(b) we see that the conditions (c) 
and (d) yield the relations 


_ _ (dwer _ {dwy, _ (dw. = ‘o») 
(“ i = >) .. eo (‘e) 7 e& rat On) 


The deflections and their derivatives may be written in terms of the 
unknown quantities Qo, Qo, Mo, Ms: as follows (see Ref. 5, p. 403): 


(14) 








2 
(wer) = — FE [Qax(B1) + BMoxs( 61) 
d 2° 2 oe 
We 
( oe) = re [Qoxa(Bl) + 26Moxs(6l)], 


where the notations in (3) are used. 
From equations (9) and (11) it is found that in the infinite plate the 
following relations hold 
(uw), = 2 +») _a(1 + 9) 


a Bhp 1 = Eh, (Qo a Q2) 


dw, = a = a 7 
(f) =p rt Me 


(17) 
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Tinally, the cylinder distortions are given by (see Ref. 5, p. 393) 


(Wee) 20 = a [eM + Qe] 


dWe2 
(Mt) = sy Bema + Oa 


The required four equations to determine the unknowns Qo , Q2 , Mo, 
MM, may now be found by substituting expressions (16), (17) and (18) 
into conditions (14) and (15). If we again set »» = ». = 3, then 


(18) 





8 


4 — — ie: 
pS 3a2h2 


and the four equations reduce to 


| 22 aucaty + 2H | Se 4. FR 4 V8 ala) ME = 4 











EAs fa 2 


2R Qo ATE Qe V8/3M2_ _, 
he Oo 


3h, ae T Bh2 ? 


Mo 


A (19) 


V/8/3 hexe(Bl) = Qo +|2V8R h-Bxs(6l) + 8R mee 


rs 


— 8k 


3M, 
iz =0 


0. 





ia 
ee Qo h Mo h. ee 
Wee a Moy | A+ ort 3 | a 

IV. SOLUTION OF PROBLEMS 


Once the edge loadings are known, the unit stresses in the ceramic 
cylinders may be calculated. Shell theory assumes that the normal 
stresses, oz and og, vary linearly through the thickness of the shell from 
a maximum value at one surface to a minimum value at the other sur- 
face. The shearing stress, 7 = 72, , is assumed to vary across the thick- 
ness of the shell according to a parabolic law with a maximum at the 
center. The variation of these limiting values as a function of distance 
from the joint is given by (see Ref. 5, pp. 45 and 88) 
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6 
(oe pean — 72 M, 
Ei, 
(eg )eisie =. (0 me: ae “ana (20) 
me) 
Tmax 9 he 5] 


where Q,, M, and wz are the force, moment, and deflection in the 
cylinder at a distance x from the joint. 


4.1 Radial Unbalance 


Of interest here is the magnitude and distribution of the stresses pro- 
duced in the cylinders as the radius of cylinder #2 is increased (Fig. 
3a). The cylinders are infinitely long; the expressions for the required 
quantities are (see Ref. 5, p. 393) 

1 
Mei = 3, [8:Moe (Bix) + Qot(Bix)] 
- MoW(B.) ) on 
Wei = ~ 2B, os. (8; ob (Bix 2 Qo9( Bix 
Qe: = —26;Mot (Bix) + Qy(Bix), 


for 7 = 1, 2. Here x is the axial displacement from the cylinder-plate 
joint. 


4.2 Axial Unbalance 


The effect that varying the length of the middle cylinder has on the 
stresses in both cylinders *1 and xX 2 is of interest (see Fig. 3b). Cylin- 
der 1 is infinitely long and, corresponding to (21), the required ex- 
pressions are 


Ma = =[8M2e(6r) + Q2f (Bx )| 


: 
B 
(22) 


[BMiy(Bx) + Q26(8x)] 


Wel 


— ap 
Qu = —26M£(Br) + Q(Ba). 


Cylinder * 2 is finite in length and the expressions for the required quan- 
tities are 
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M2 =5 [A cos BE cosh BE — Q sin BE sinh Bé] 
Wea = —Te - [A sin B& sinh BE + Q cos BE cosh BE] 
Qi. = —2(A + Q) sin BE cosh BE + 2(A — Q) cos BE sinh BE 


where & = x + 1/2 is the axial displacement measured from the center 
of the short cylinder and 


Qo sin i sinh 5 pl + BMo (sin ee cosh 5 Bl + cos 7 Venn zt) 


mie sinh Bl + sin 


Qo cos ee cosh 5 Bl + BM (cos 8 Be sinh Z — sin ne cosh a 


We sinh Bl + sin Bl 
These results are obtained by superposing the solution for the case of 
bending by uniformly distributed shearing forces Qo and the solution for 
the case of bending by moments Mp distributed at the ends (see Ref. 5, 
p. 403). 


V. STRESS DISTRIBUTIONS AND MAGNITUDES 


Calculations were carried out on an electronic computer to demon- 
strate the effects of the two types of unbalance. It was decided to use 
the value R = 2.1 for the ratio of elastic moduli corresponding to the 
Diamonite ceramic cylinders and the Kovar plates employed in the 
particular tube structure illustrated in Fig. 1. Here A is a positive quan- 
tity since ap > a, and JT’) > T. The dimensions of the tube were also 
chosen to correspond to this practical geometry and the plate thickness 
was varied. 


ho = 0.04in., a@=0.22in., hp» = 0.01, 0.02, 0.04 in. 


Of interest are the stress distributions in the cylinders, especially the 
values attained at the seal junctions. These have been obtained for both 
radial and axial unbalanced seals for the three plate thicknesses. 


5.1 Radial Unbalance 


The stresses in the two cylinders at the cylinder-plate junctions are 
depicted in Fig. 6. The first graph shows the longitudinal stress at the 
outer wall of each cylinder; an equal tensile stress occurs at the inner 


- h 
(z)x =o, z=-- 
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0.6 


y hp= 0.04 IN. 
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RADIUS a 
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0.2 Np = 0.02 IN. 


hp= 0.04 IN. 
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Fig. 6 — Stresses at the junctions of radially unbalanced seals. The values for both cylinders are shown 
as the radius, b, of the balancing seal is varied. 
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wall. The circumferential stresses are compressive all across the wall 
thickness; the maximum numerical values are attained at the outer wall 
and these are shown in the second graph of ig. 6. The maximum shear 
stress acting on the plane of the cylinder-plate junctions is shown in the 
third graph. 

It is seen that as the amount of the unbalance is increased, i.e., b is 
increased, the junction stresses in the smaller cylinder all decrease. In the 
larger cylinder the longitudinal stress at the junction decreases, but the 
circumferential and shear stresses increase. This trend actually continues 
until the radius of the larger cylinder is several times that of the smaller. 

The variation of the stresses as a function of distance from the junc- 
tion is shown in Irig. 7. It is of special interest to compare the results for 
completely balanced seal, Fig. 7(a), with those of a completely unbal- 
anced seal, Fig. 7(c). 


5.2 Axial Unbalance 


Figure 8 shows that as the amount of axial unbalance is decreased, i.e., 
lis increased, the junction stresses in the long cylinder decrease and then 
increase again to a limiting value. The stresses in the short cylinder are 
practically the same as those in the long cylinder with the exception of 
the shearing stresses. The shearing stress in the short cylinder increases 
with decreasing unbalance to a limiting value. From the standpoint of 
both cylinders the optimum length for the intermediate or short cylinder 
appears to be about 0.125 in. The somewhat larger stresses correspond- 
ing to the perfectly balanced configuration are attained for / greater than 
approximately 0.3 in. 

If J is allowed to tend to zero the configuration reduces to a perfectly 
balanced seal with a plate of twice the original thickness. The stresses 
for a given plate thickness at 1 = 0 must therefore equal the stresses 
produced in a structure using plates of twice the thickness at very large 
values of 1, say 1 = 0.4. Figure 8 shows this very well. 

Figure 9 illustrates the variations in the stress distributions for several 
lengths of the intermediate cylinder. 


VI. DISCUSSION 


Practical experience has shown the balanced seal to be stronger than 
the completely unbalanced seal (7) under routine handling conditions, 
and (zz) under thermal shock in subsequent welding operations. In the 
former case the unbalanced seal is more susceptible to a failure of the 
bond between the cylinder and plate; in the latter case the weakness is 
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a= 0.22 IN. 
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Fig. 7 — Stress distributions showing the effects of various conditions of radial 


unbalance; (a) completely balanced, (b) slightly unbalanced, (c) completely un- 
balanced seals. 
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Fig. 8 — Stresses at the junctions of axially unbalanced seals. The values for both cylinders are shown as the length, I, 
of the intermedate cylinder is varied. 
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Fig. 9 — Stress distributions showing the effect of varying the length, J, of the 
intermediate cylinder in an axially unbalanced seal. Note the difference in scale 
for long (right side) and short (left side) cylinders. 
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manifested by longitudinal cracks in the unbalanced ceramic cylinder. 
It is of interest to interpret these facts in the light of the calculated 
stresses presented above. 

Since type (2) failures clearly cannot result from compressive circum- 
ferential stresses and the longitudinal junction stress is essentially zero 
for the unbalanced seal, Fig. 7(c), the culprit must be the shear stress 
at the junction, i.e., the metallized interface fails in shear. Comparison 
of rmax for the two structures, Fig. 7(a) with Fig. 7(c), shows that indeed 
its value is about 80 per cent greater for the unbalanced seal than for 
the perfectly balanced seal. 

Type (2) failures occur when the metal plate is raised to a higher tem- 
perature than the ceramic cylinder by uneven heating. This introduces 
tensile circumferential stress in the cylinder which results in longitudinal 
cracks if the temperature difference is great enough for the residual com- 
pressive stress to be sufficiently exceeded. A first approximation to the 
relative values of these transient stresses is obtained by merely reversing 
the sign of A in the calculations presented. Comparison of Fig. 7(a) and 
Tig. 7(c) then shows that a much larger circumferential tensile stress 
would be expected in the unbalanced seal under such a thermal gradient. 

If these explanations are valid it may be concluded from Fig. 6 that 
the balancing cylinder (radius b) is even more effective for b > a. For, 
the magnitudes of the shear and circumferential stresses in the smaller 
(and more crucial) cylinder decrease as b is increased. The unbalance 
cannot be made too great, however, as the corresponding stresses in the 
larger cylinder increase. 

In the case of axial unbalance it is clear (Iig. 8) that the intermediate 
cylinder must have length / > 0.125 if the presence of seal at the opposite 
end is to be negligible. If this inequality is satisfied the susceptibility to 
failure of either type (2) or type (22) is greatly reduced. 
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The Tunnel Diode as a Linear 
Network Element 


By I. W. SANDBERG 
(Manuscript received April 3, 1962) 


Theorems are proved which completely characterize in an explicit manner 
the class of immittance matrices realizable with lossless reciprocal elements 
and a tunnel diode represented by the three-parameter “LC ,—R’’ model. 
Techniques are presented for the synthesis of any immittance matrix within 
this class. 

Considered first, from a scattering matrix viewpoint, are the so-called 
degenerate cases in which the immittance matrices of the lossless network 
do not exist. Throughout the remainder of the discussion it is assumed that 
the lossless network possesses an immittance matrix. Necessary and suf- 
ficient conditions, involving in a complicated manner the existence of a 
certain strict Hurwitz polynomial, are derived for realization with a wide 
class of terminations. A study of the existence of this polynomial for the 
particular terminations of interest leads to explicit realizability conditions. 


I. INTRODUCTION 


The small signal ‘“C , —R”’ model of the tunnel diode (Fig. 1) provides 
a fairly good representation over a wide range of frequencies, and is 
much simpler to use in a general study of network properties than the 
“LC,—R” model (Fig. 2) which includes, in addition, the series in- 
ductor. The simpler model has been used extensively by network theo- 
rists.1-10 

The primary purpose of this paper is to define in an explicit manner 
the class of m X n open-circuit impedance and short-circuit admittance 
matrices that are realizable with lossless reciprocal elements and a 
tunnel diode characterized by the ‘“LC, — R” model. The results con- 
stitute an extension of the theory presented in Ref. 10* for the “C, — 
R” model. The main interest in this problem to date relates to the 


* Although the present paper is self-contained, some familiarity with the earlier 
work would be of assistance to the reader. 
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c 


-R 


Fig. 1 — ““C, —R’’ model of the tunnel diode. 


special case n = 2. From a practical viewpoint our results anticipate 
the development of approximation techniques that lead to the specifica- 
tion of lossless-network tunnel-diode substructures which are to operate 
between prescribed sources and loads. Indeed an objective of this paper 
is to encourage research in this direction by presenting a complete solu- 
tion to the realization problem. 

The basic structure under consideration is shown in Tig. 3 in which 
the (n + 1)-port network is assumed to be a lossless reciprocal configura- 
tion containing inductors, capacitors, and ideal transformers. While we 
shall be particularly concerned with the case in which port (7 + 1) is 
terminated with the “LC, — R” model of the tunnel diode, many of 
the arguments to be presented are applicable to a much wider class of 
terminations. The overall network is assumed to possess either a short- 
circuit admittance matrix Y(s) or an open-circuit impedance matrix 
Z(s) relating currents and voltages at the ports (1, 2,--- , 7). 

The realizability study is initiated in the following section where we 
discuss the cases in which the immittance matrices of the lossless net- 
work fail to exist. Throughout the remainder of the paper we consider 
the realizability of Z(s) and assume that the (x + 1)-port lossless 
network possesses an open-circuit impedance matrix Z(s). This involves 
no loss of generality, of course, since results for the short-circuit admit- 
tance matrix Y(s) are identical with those for the open-circuit impedance 
matrix with the termination replaced with its reciprocal. In Section III 
necessary and sufficient conditions are presented, in terms of an unknown 
strict Hurwitz polynomial, for the realization of Z(s) with a wide class 
of terminations. The following sections utilize these results to obtain 
explicit realizability conditions for the particular termination of interest. 


c 


—R 


Fig. 2— “LC, —R” model of the tunnel diode. 
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Fig. 3 — Most general structure defining Z(s) and Y(s). 


In an interesting recent paper, Schoeffler considers a problem similar 
to that discussed in Section III for the special case n = 1 under an 
assumption equivalent to supposing that the unknown polynomial is 
unity. In order to obtain explicit conditions, he further assumes that 
both Z(s) and the termination are regular at infinity and that Z(s) has 
no singularities on the entire jw-axis. Of course, for our purposes, these 
assumptions cannot be made. Indeed, for the particular problem con- 
sidered here, the most interesting realizability conditions arise from a 
possible pole at infinity of the termination and from the influence of 
the unknown strict Hurwitz polynomial. 


Il, REALIZABILITY CONDITIONS WHEN THE IMMITTANCE MATRIX OF THE 
LOSSLESS NETWORK DOES NOT EXIST 


The (n + 1)-port lossless network in Fig. 3 invariably possesses a 
symmetric regular para-unitary scattering matrix” which we shall denote 
by S(s). However, the corresponding short-circuit admittance matrix 
Y(s) exists if and only if* det{1,.: + S(s)] does not vanish identically 
in s. Similarly Z(s) exists if and only if det{1,41 — S(s)] does not vanish 
identically in s. In this section the following theorem is proved which 
completely characterizes Y or Z in the event that ¥ or Z fails to exist. 


Theorem 1: If Y {Z] in Fig. 3 exists with port (n + 1) terminated with an 
admittance y [impedance 2] but Y [Z] does not exist, Y = yC + Y’ [Z = 


* The identity matrix of order (n + 1) is denoted by lay . 
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2zC + Z’] where C ts a nonnegative definite symmetric real matrix of con- 
stants of rank not exceeding unity and Y’ [Z’| ts the short-circuit admit- 
tance matrix [open-circuit impedance matrix] of a lossless reciprocal net- 
work, 

The proof is based on the following lemma which is adapted from a 
result of Youla et al.” 


Lemma 1: Let 8(s) be a regular symmetric para-unitary scattering matria 
of order (n + 1) such that the normal ranks of [1n41 + S(s)] and [liga — 
S(s)] are r and r’ respectively. Then there exist two orthogonal constant 
matrices T and T’ such that* 


tA —1Lisi, 0 
T S(s) T= 6 (1) 

0 S, 

MA ! Lisi 0 
T’ S(s) T = me | (2) 

0 5, 


where 8, and 8,» are symmetric regular para-unitary scattering matrices 
of orders r and r’ respectively. Moreover det{1, + S,] and det{1, — §,] 
do not vanish identically in s. 

Suppose now that Y(s) in Fig. 3 exists but that Y does not exist. 
Then the normal rank of [1n41 + S(s)] isr < (n + 1). Equation (1) 
can be interpreted as a realization of S(s) in terms of a (2n + 2)-port 
ideal transformer network, n + 1 — r short-circuits, and a reactance 
r-port possessing a short-circuit admittance matrix Y, = [1, + SJ[L — 
S,], as shown in Fig. 4. 

Since E, = T‘E,, where E, = [en, €2,°°*, €mnzyn] and E, = 
[€a1 , a2, °** » Ca(n+1)|, the number of independent linear relations among 
the components of E, is equal to the number of zero components of 
E, . However, since Y exists but Y does not exist, this number is equal 
to unity (r = n), and the resulting single linear constraint is 

n+1 


2d tj1€aj = 0, Un41)1 F 0 (3) 
j= 


in which the ¢;; are the elements in the first column of T. As a conse- 
quence, it is a simple matter to show that we may construct an (n + 
1) X n matrix of real constants A such that 


E, = AE, 4) 
Al, = 1, 


* The superscript ¢ denotes matrix transposition. 
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Fig. 4 — Realization of S(s) when the normal rank of [1py. -+ S(s)] is r. 
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where 
Es’ = [eacntay » C02 35 °°" Cinta] 
LY = [aay y tee, toe, ++ Tong 
Eu’ = [ea, @a2,*** > Can 
Ln! = [lar , tan, +++, Zan. 
But 
ae ' : |p (5) 
0 Y, 
which, together with (4), yields 
i. =a? » [AB (6) 
0 Y, 
Thus, 


0 Y, (7) 
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where B‘ is the n-vector of elements in the first column of A‘ and D is 
the matrix of elements in the last n rows and columns of A. 

A very similar argument suffices to establish the result for the case 
in which Z fails to exist. 


III, REALIZABILITY OF Z(S) WHEN Z EXISTS 


Throughout the remainder of the paper we consider specifically the 
realizability of Z(s) under the assumption that Z(s) exists. As men- 
tioned earlier, this is equivalent to assuming that Y exists and that the 
termination is replaced with its reciprocal in order to study the prop- 
erties of Y(s). 

We shall suppose that the impedance terminating port (7 + 1) is the 
positive-real function z(s) = ab’, where a and b are Hurwitz poly- 
nomials. Of course the impedance of the LC, — R model is not a posi- 
tive-real function. However, it is convenient to replace the negative 
resistor with a positive one of equal magnitude so that we may state 
that Z(s) is a positive-real matrix, with the understanding that Z(s) = 
—Z(—s) where Z(s) is the impedance matrix of the n-port with the 
resistor negative.’® Further, it is sufficient to assume that the LC, + R 
termination comprises an inductor of value a Henries in series with a 
parallel combination of a unit resistor and unit capacitor, for any other 
values can be accommodated by impedance and frequency scaling. Thus, 
we shall be particularly interested in the results for z(s) = (as” + 
as + 1) (s + 1)’, (a = 0). However in this section we shall merely 
require that* 


aed, — Aobdp, (8) 


Ze = b.2 = b,2 > 


the even part of z, have no zeros on the finite jw-axis and that b(s) is a 
strict Hurwitz polynomial. 

It is well known that 
1 


a as eT 
Z il Luk 7 


(9) 


where the submatrices in (9) are defined by the following partition of 
Z(s): 


n i 
os Zi Zi [n 
Z(s) = | ; . (10) 
Zin Zo} 1 


* Throughout the paper we shall denote by the subscripts o and e respectively 
the odd and even parts of polynomials or matrices. Thus, for example, A, = 
2[A(s) + A(—s)], Ao = 3[A(s) — A(—s)]. 
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The arguments to be presented center about a study of Z., the even 
part of the matrix Z. This matrix is given by 


1 
Tae eee 


It is convenient to introduce the notation Z2 = d nx, Ze = d ‘Ny 
where d is an even polynomial, nz. is an odd polynomial and Ny is an 
n-vector of odd polynomials, with the understanding that d, ne, and 
every element of Nj. may have a common simple zero at the origin.* 
In this way it is unnecessary to treat separately the cases in which d is 
even or d is odd. 

Accordingly, 


Ze = — (acbe <r obo) NizNi2' 


Ze = — 3[z(s) a 2(—8)]Zy2Z19' 


1 (12) 
[bnx» + ad][b(—s)nx2(—s) + a(—s)d(—s)]” 

Note that the assumptions regarding z, m2. , and d require that the poly- 
nomial [bre + ad] be strictly Hurwitz except possibly for a simple 
zero at the origin. Also, as one would expect,” the zeros of [bro + ad] 
cannot coincide with any of those of (a.b. — a.b.). This follows from 
the fact that the existence of a nontrivial solution for a and b satisfying 
bn + ad = 0 and (a.b. — ab.) = $[ab(— s) + ba(— s)] = Oat some 
point s = s; requires that s; satisfy b(— s,)N2(— s1) + a(— s)d(— 
s:) = 0, which contradicts the fact that the zeros of bro: + ad are re- 
stricted to a half-plane. 

It is convenient to state the following 

Definition: The matrix Z, 1s said to be in standard form if and only if 


1 
Z. = _ De — Do UU’ ———_— 
ee ae aes) 
where v(s) ts a positive coefficient polynomial which is strictly Hurwitz 
except possibly for a simple zero at the origin and U' = [uy , U2, +++ , Unl 


is a row matrix of odd real polynomials with the property that there ts no 
factor n(s)n(— s) common to all the u; such that °(s) divides v(s) where 
n(s) ts a strict Hurwite polynomial. Further, v(s) and (acbe — dob.) are 
relatively prime. 

In Section 3.1 the following result is proved.t 


Theorem 2: Denote by z(s) the two-terminal posttive-real impedance z(s) = 
ab’, with b a strict Hurwitz polynomial and z. having no zeros on the 


* With this exception, no: and d are assumed to be relatively prime. 
{ We shall use the notation lim [-] = [-], throughout. 
sa 
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finite ja-axis. Then the rational positive-real open-circuit impedance matrix 
Z(s) ts realizable as shown in Fig. 3, with the understanding that the lossless 
reciprocal network possesses an open-circuit impedance matrix Z(s), if 
and only if Z, can be expressed in standard form and there exists a strict 
Hurwitz polynomial (s) defining 


X ='+Un(s)n(— 8s),  w = v7'°(s) 
such that 


(1) (Wee — Welle) (Wobo — Webe) is a reactance function, the de- 
generate case in which (wobo — webe) = 0 not permitted. 
(ii) |x | exists, and 


We Qo = Woe 


(itt) E z | — |Z. fie | ts nonnegative definite when the 


S(AcWe — AcWo) leo 
reactance function in (t) has a pole at infinity. 


Further, any Z(s) satisfying these conditions can be realized as shown in 
Fig. 3 with Z(s) given by 


t b(acbo —_ abe) X (agbo = abe) 


3 ORE a baa ne SH 
8 = 
x! (aobo = ede) (AoW. —_ AW) 
(bowWo — b.We) (boWo — bw.) 


3.1 Proof of Theorem 2 


It is evident that a prescribed Z,. can be expressed in standard form 
if Z is realizable.* Suppose that Z and Z, in standard form are given 
and consider the problem of determining d, nx» , Nis, and Zy, . In partic- 
ular, let us consider identifying d, nx , and Ni2 by equating the standard 
form expression for Z, with the right-hand side of (12). A common 
factor may have been cancelled in the expression for Z, and hence an 
unknown factor must be reinserted before we can proceed. However, the 
unknown factor must be of the form f’(s) = g(s)g(—s) where g(s) isa 
strict Hurwitz polynomial. Therefore f(s) = n(s)n(—s) where n(s) isa 
strict Hurwitz polynomial. It follows that the most general expression 
for Z, of the form: 


1 


Ze= — (abe = obo) XX' w(s)w(—s) 


(13) 


* The problem of factoring Z, into this form is discussed elsewhere.!° 
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in which 
1. w(s) is a positive coefficient polynomial which is strictly Hurwitz 
except possibly for a simple zero at the origin, and 
2. X‘ is a row matrix of real odd polynomials 
is expressible in terms of the standard form expression with 


X = + Un(s)n(—s) (14) 
w = vn (s) (15) 

where 7(s) is an arbitrary real strict Hurwitz polynomial. 
Thus if Z is realizable, X = Ny and w = bn» + ad for some X and w 
generated by (14) and (15) with m2 and d respectively odd and even 
polynomials that are relatively prime, except possibly for a common 


simple zero at the origin, such that nad’ is a reactance function. Equat- 
ing the even and odd parts of w and big + ad yields: 


d (b,We —bewWe) (Voto —DeAe) (16) 


fag = (AsWe —AdWo) (Dodo — bee) (17) 


Suppose now that fxd’ is a reactance function. Then the two fune- 
tions: 


w(aobo a ade) 





Noe = 

a ee 
and 

d —1 _ wagb, pags a,b.) 

Rog ~ A(AgWe — AcWo) 1?) 


are required to be positive-real. Since the even polynomial (a,b, —a.b-) 
is either a constant or has zeros in the right-half plane, it is evident from 
(18) and (19) that (16) and (17) are polynomials. Furthermore in 
view of the positive-real property of (18) and (19) and the fact (16) 
and (17) are respectively even and odd polynomials, it follows that the 
zeros of (16) and (17) are restricted to the jw-axis and that these zeros 
are simple, except possibly for a double zero of d at the origin which 
can occur when v and hence w has a simple zero at the origin. With this 
single permissible exception, it is also the case that (16) and (17) are 
relatively prime, for the condition that there exists a nontrivial solution 
for w, and w, in 


boWo —b.We = 0 


—AWo + ApWe = 0 
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is (a,b, —a-be) = 0, which, by assumption, is not satisfied for pure 
imaginary values of s. Thus, if ‘ind is a reactance function, 
w = bn» + ad with my = fin and d = d where, as required, nx. and d 
are relatively prime polynomials except possibly for a common simple 
zero at the origin. 

Next consider the determination of the submatrices Zi. and Zy,. For 
d = d given by (16), 


( boo —_ bd.) 


— =k = 
Zi. = Xd xX (ita (20) 
Using (9), z(s) = ab, d = d, nx = fix , and (20), we find 
Zi = ZAK d(acbo = ede) (21) 


(boWoe = bw.) w 


By substituting Z = Z, + Z. in (21) with Z. given by (13), it is easy 
to show that (21) is a matrix of odd functions, as it should be. Further- 
more, since (bow, —b.we) = d(a.bo —ab-), it is evident from (21) that 
Zi is regular in the entire finite strict left-half plane and consequently 
has finite poles only on the jw-axis. 

Consider now the realizability of 





t b(aobo aod abe) (abo = ede) / 
a 4 - a (bowWo = b.We) Ww (bow aad b.w.) 
Z(s) = = (99) 
| x! (abo = ab.) (AWe = AWo) 
(bowo — bw.) (boWo — beWe) 


We require the following lemma.!° 


Lemma 2: The symmetric matrix of real constants 


m An Ar 
A = i ’ Ax» > 0 
Ai Ax 


partitioned as in (10) ts nonnegative definite «f and only if Au — 
AAAs is nonnegative definite. 
Let K; denote the residue matrix at a pole of Z(s) which arises from 


a zero of (b,w, —b.w.) at say s = jw; , and let the residue matrix of Z at 
that pole be K; . Then,* 


* When w = su = s[u.e + uo], where u is a strict Hurwitz polynomial, it is neces- 
sary to replace w, , wo. and X respectively with u., we and the n-vector of even 
polynomials s'X before this argument is applied to verify the nonnegative defi- 
niteness of the matrix of residues associated with the pole at the origin. 
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K,; ae Xx! b(asbo — aeb,) X (abo i Aede) 
qw q 


K; =] (23) 
x! (aabo —_ Aebe) ApWe — AeWo 


q i] s=ju,; 





where q is the derivative of (b,w, —b.w.) with respect to s. Since 
(4) (aw. —a-w,) evaluated at s = jw; is positive, K; is nonnegative 
definite if and only if 


t b(aabo — ede) 


K; + XX 2 
qw 





s=jo; 
ee (abo “ra debe)” y t q 
(q)? (AoWe a A.Wo) 


= K; + Xx! 1 | owe — a,b, )a(b.we 5 Pave) | 


q W(AcWe — AeWo 








(24) 





sS=jw; 








s=jw; 


is nonnegative definite, a condition which is clearly satisfied since 
(bw. — bow.) vanishes at s = ju;. 

Finally, we require that Z have at most a simple pole at infinity and 
that K,, = [(1/s)Z],, be nonnegative definite. It is clearly necessary 
that the limit 


[Zoo *Zisle = x {adhe = ae) | (25) 
(avWe pie AeWo) oe) 


exist. When (Zo: 'Zy],, does exist with Zo. a reactance function, it fol- 
lows from (9) that Zi has at most a simple pole at infinity, since Z is 
assumed to possess this property, and consequently that Z has at most 
a simple pole at infinity. 

Suppose now that Ze. has a pole at infinity. According to Lemma 2 
and (22), K, is nonnegative definite if and only if the following matrix 
of constants is nonnegative definite: 


t b(acbo — aebe) | - E Xx (ab, — ae) | 


(bow. — bew.)w s” (boWo — beWe) 
iE Xx! (aobo = oes) E (aoWe a all 
Ss (b.Wo a b.We) wo LS (b.Wo i b.We) fre) 
Some manipulation shows that (26) can be rewritten as simply 


Kok lz (eae (27) 


* s(A.We — AcWo) 


K, + E XX 
(26) 
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If Zo. does not have a pole at infinity, K,, is nonnegative definite if 
and only if Zi. does not have a pole at infinity and [(1/s)Zul],, is non- 
negative definite. The first of these requirements is contained in the 
condition that (25) exist, while the second follows from the assumed 
positive-real property of Z, for in this case [(1/s)Z]. = [(1/s)Zul,, . 

This proves Theorem 2. 


3.2 Remarks Concerning Theorem 2 


Consider first the conditions under which the degenerate situation 
(b.w. —b.w.) = 0 can arise. Assume that both b, and b, do not vanish 
identically in s. Note that a strict Hurwitz w cannot satisfy the equation, 
for b is assumed to be strictly Hurwitz and hence if w is also strictly 
Hurwitz b,w, vanishes at the origin while b.w, does not. The alternative 
possibility is that w = su where uw is a strict Hurwitz polynomial. In 
this event we have (ub. —uobe) = 0 and therefore* u = b. This leads 
to the following expression for Z, : 


1 
[s(be + bo) |[—s(be 7 bo) | 


Since each element in the matrix Z, must approach zero at least 
as fast as z., which is obvious from (11), the degree of each polynomial 
in the n-vector of odd polynomials X is at most unity. Thus X contains 
no non-jw-axis factors that can be cancelled and consequently (bow. — 
bewe) = O implies that 7’ is a constant, say unity, and w = s(be + bo) 
or, equivalently, (b.v. —b.v.) = 0. However given (bv, —b.v.) = 0, it is 
not clear that a nonconstant choice of 7° could not render d fz. real- 
izable as a reactance function. To resolve this question assume that w = 
n's(be +b.) and consider z + dfx, which must be a positive-real func- 
tion if dfx is realizable. Some algebra yields 


Ze = —(a-b. — Ab.) XX’ 


2 
ge dig Se, (28) 
2neNo 
It is clear that (28) is not a positive real function for any choice of 7 
when b, , b, # O because of the right-half plane poles of z, . Thus, 


Lemma 3: When b. , bo 4 0, condition (2) can be replaced with the statement: 
(Wee —WeAe) (Weds —Webe) is a reactance function and (bv. —bede) F 0. 

The discussion relating to the realization of Z when z = (s + 1) 
shows that the assumption b. , b, ¥ 0 is necessary. 


* We are ignoring a trivial constant multiplicative factor. 
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We wish to show now that condition (777) is invariably satisfied when 
2 is regular at infinity. This occurs because the limit 


E =e). 1 (29) 


* s(QoWe — AcWo) 


vanishes when z is regular at infinity. To prove this it is sufficient to 
consider the limit obtained by evaluating (29) with Z,. replaced with 
z.. Using w = biix + ad to compute w, and w, , we find 


E a(w. — Wp) | =i.|8 Qe — a) _ | 
* s(doWe — AW.) | 8 Lti2 \be — bo 


from which our assertion is obvious.* Thus, 











Lemma 4: Condition (217) ts satisfied when z(s) ts regular at infinity. 

In the following sections we shall use Theorem 2 and Lemmas 3 and 
4 to obtain explicit realizability conditions for z(s) or z‘(s) = (s + 1)™ 
(as’ + as + 1). We assume throughout that Z(s) is known to be a rational 
symmetric n X n positive-real matrix, and that Z, in standard form ts 
given byt 


1 
Z. = —UU' —__—_.. 
v(s)v(—s) 
To further avoid repetition, the term “realizable with an impedance z(s)”’ 
ts to be understood to refer to the realizability of the multiport matrix as a 
structure shown in Fig. 3 with the provision that Z(s) exists. It 1s convenient 
to treat separately the cases in which a = 0, anda > 0. 


(30) 


IV. EXPLICIT REALIZABILITY CONDITIONS WHEN 2 = (s + 1)7! AND Zz = 


(s + 1) 
When z(s) = (s + 1)~’, conditions (7) and (i) reduce to 


(4) wo(we —sw.) must be a reactance function and v, # sv, (see 
Lemma 3). 


(77) [X(1/w.)],, must exist. 
According to Lemma 4 condition (722) is satisfied. 


* This result can be established in a more direct fashion by observing that the 
nonnegative definiteness of K,, is implied by Lemma 2 and expression (9) when 
z(s) is regular at infinity in view of the positive-real property of Z(s). Neverthe- 
less, it is instructive to consider this matter from the viewpoint presented above. 

+ Here (a.b. — a.b.) = 1. 
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Condition (7) requires that [(1/s)(w./w.)], 2 1. Observe that w 
must be of even degree and that [(1/s) (w-/w.)],, is simply the reciprocal 
of the negative of the sum of the zeros of the polynomial w(s). It follows 
at once that (72) is satisfied for some 7(s) if and only if [(1/s)(v-/v.)]., 
= 1. Requirement (77) is satisfied without additional restrictions on 


Z(s), for (2) implies that 
=| ou!) —_ xy] es 
le = | — arama | = 8 aa 
since X is an n-vector of odd polynomials and Z, is bounded at infinity. 
This proves 
Theorem 3:* 


The matrix Z is realizable with an impedance (s + 1)” * if and only af 


Ve 
°; —_— > 
Ve ev, and eI ai 
4.1 2(s) = (s + 1) 
In this more interesting caset the three conditions become 
(i) (wo —swe) (we) must be a reactance function and w, ¥ 0. 


(it) [X (w, —swe)’],, must exist. 


(202) E z | - [2,2 49) (oe — a) | must be nonnegative definite 


S(SWe — Wo) | 
when (w. —sw.) (we) has a pole at infinity. 


From (7), k’ = [w./sw:],, 2 1. Since 


1 


Z, = — XX w(s) w(—s)’ 


and (7) requires that w(s) be of odd degree, (77) is satisfied for hk’ > 1. 
However if k’ = 1, (77) is satisfied if and only if [Z.].. = 0. In terms of 
k’, condition (777) is equivalent to the statement 


1 ki 
[iz] - pi. (32) 


§ 


must be nonnegative definite when k’ > 1. 


* This result was stated without proof in Ref. 10. 

+ This case together with the situation in which Z does not exist is treated in 
detail from a somewhat different viewpoint in Ref. 10. It is included here pri- 
marily to illustrate the application of Theorem 2. 
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Consider now the influence of the strict Hurwitz polynomial 7(s). 
Note first that Z. may be a matrix of constants; that is, v(s) may be 
equal to ys where y is a positive constant. Let 8 be the reciprocal of the 
negative of the sum of the zeros of 7(s). In this case k’ = 6 and 8 can 
be chosen arbitrarily large to minimize k’(k’ — 1)~’. Therefore when Z, 
is a matrix of constants, (7), (22), and (777) reduce to the requirement 
that 


[iz] - +o. 


be nonnegative definite for some* « > 0. 

When Z, is not a matrix of constants, the most favorable choice of 
n(s) is simply a constant, for then k’ is maximized and k’(k’ — 1)" is 
minimized. Thus, 


Theorem 4: The matrix Z is realizable with an impedance z = (s + 1) if 
and only if 


1. When Z, is a matrix of constants, [(1/s)Z|,, — (1 + e€)[Z-] is non- 
negative definite for some « > 0. 


2. If Z.is not a matrix of constants, k = [v./sv.|,, 2 1; ¢f k = 1, [Zelo = 
0;7fk > 1, [A/s)Z],, — [k/(k — 1)][Z-]., is nonnegative definite. 


V. EXPLICIT REALIZABILITY CONDITIONS WHEN 2(s) OR 2! (s) = (as? + 
as+1)(s+ 1) t,a>0 


In these cases, as will become clear, the polynomial n(s) plays a 
central role in determining the realizability conditions. We shall con- 
sider first the case: z(s) = (as” + as + 1)(s + 1). Here condition (2) 
requires that 


2 
: SWe — Wel as 1 Ww. 
(i) AsWe o(a + ) oe as + 0 
SWo — We We — SWo 





must be a reactance function and, using Lemma 3, sv, 4 ». 


It is clearly necessary that [w./sw.]. = 1. Thus we may assume that v 
and w are of even degree. Let 


v(s) = > vps” (33) 


* If the lossless network is not required to possess an open-circuit impedance 
matrix, e can be taken to be zero.!9 
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2p 
w(s) = a(s)o(s) = 2 ws! (34) 
where p = m + (degree of 7). Then, since the reactance function must 
have a nonnegative “residue”’ at infinity, . 


IV 
Rg 


Wop-1 (Wep—2 — Wap ) ee 


OF SINCE Wep_1 = Wey , 


IV 
cm) 


1 — a(w'ep_2 — W'op-s) (35) 


where wz’ = Wy(Wop) 
Condition (22) reads: 


[ as’ — (as + 1) | | —1 | 
m) |x aswe — (a8 + 1)wolen ares — SWo) — Wo_jeo 
must exist. 


Assume first that (35) holds with strict mequality in which case (772) 
becomes 


—1 
[x Ee | (36) 


must exist. But [X(1/s’”)]o = 0, since w is of even degree, and therefore 
the limit (36) does indeed exist. Suppose now that (35) holds with the 
equal sign. Then (27) requires that 





—1 
fe 
[x O(Wap4 — Wap-5) SP? — Ways =I, en 


exist.* Hence the degree of X must be (2p — 3) at most. Since the degree 
of w is 2p, (37) will exist if and only if 


[s’"Zel, = 0 (38) 
Consider now condition (722) which requires that 
E Z| + [8Z.]., —_ (39) 
Shs 1 — a(w'ey2 — w'sy-3) 


be nonnegative definite when (35) is satisfied with strict inequality. 


From the form of conditions (35) and (39), it is clear that the most 
favorable realizability conditions for a given Z(s) are obtained when 
* It can easily be shown that the reactance function property of the expression 


in (7) implies that the denominator in (387) does not vanish identically in s when 
(35) is satisfied with equality. 
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n(s) is chosen to satisfy [w./(sw.)]o = 1 and to simultaneously mini- 
mize* (w'ep—2 — W' op-3)- 


This obviously requires that [v./(sv.)],, 2 1. We wish to establish 
Lemma 6: The minimum value of (w'sp2 — wW'ep-3) 18 
1 Vom—1 ; Vom—3 [fe | | 1 | Com—1 . 
= = lies ie Se | he 
ae E ( Vom ) | Vom = Vom Vom 12 Vom 


. 6 
and is attained when n(s) = X | s+ (1 — ‘nt) | wherer is a positive 


2m 











constant and 


Vom—-1 


< 





6 = 1, 


Vom 


5 = 0, Vam—1 


Vom 
5.1 Proof of Lemma 5 
Denote by s1 ,82,..., Sep the zeros of w(s). Using a result!* due to 
Newton, we find that 
2p 
pe Sk = (wep) ap 3(W'2p-1) (W'2p-2) = 3(W'sp-s) (40) 


Recalling that here w’s,-1 = 1, 
2p 
t 3 
W'op-2 — Wop 3 = 3 +3 os Sk 
k=1 


=$+3 die +32 se 
” ” 


(41) 


where >> and >> denote respectively sums taken only over those in- 
v n 


dices corresponding to zeros of the polynomials v and 7. Hence our 
problem reduces to determining the strict Hurwitz n(s) such that >> s,° 
n 


is minimized subject to 


a= 3/1] 50 
o] 


Vom 


Of course when >> s, = 0, 7 is simply a constant. Assume then that 
n 


* Note that [s?Z.] is nonpositive definite. 
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>. se < 0. First observe that the real-part of (— g + jh)* (g, h real 
2 


constants; g positive), exceeds — g’ for all h = 0. Hence the optimum 
n has only real zeros. Next note that d qT » sx|* when each 


Ss, is a negative constant. Thus the eat nn nis a Snel linear factor 


fei) 
Vom 


in which ) is a positive-real constant and the corresponding minimum 
value of (w'sp-2 — wW’ep-3) 18 ¢1 , Where 


= 1 1 4 1 Vom—-1 ' 
ptghe -y(t- Se). aa 


Vom 





Iexpression (42) can be written in the more convenient form given in 
Lemma 5 by using (40). 
The results of this section can be summarizcd as follows. 


Theorem 5: The matrix Z(s) is realizable with an impedance z(s) = (as” + 
as + 1) (s +1)", where a > 0, if and only if 





3. If Ll — ag, = 0, [s’Zelep = 0 
4. If1 —ag: > 0,[(1/s)Z],, + [s’Zel., le /(1 — ayr)] 


must be nonnegative definite, where 


1 Com—1 : Vom—3 Pant | tn | 1 | Vom—-1 ; 
cere a ae =a pet ef eg ee 
" 3 | ( Vom ) | Vom a | Vom Vom 12 Vom 


and the v, are defined by v(s) = dois". 








Further, if Z satisfies these conditions, Z(s) given by (22) is realizable 
with » = 1 when (v2m)(Vom-1) = 1 and with 


n(s) =s+a(i — tat) 
Vom 


when (V2m)(Vom1) > 1. 
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5.2 Realizability with z(s) = (s + 1) (as? + as + 1) 


Here condition (7) requires that 


—1 
SWe — Wo =|-as+ We | 








asw, — (as? + 1) w Wo — SWe 
. 2 
must be a reactance function and asv, — (as + 1)v. ¥ 0. Thus, 
WwW , / 
-} =1, and 1 — a (Wap4 — Wap2) = 0 (43) 
SWe lac 
2 1 k —1l 
where w = S725" u,s” and wy! = wel Wop) - 


Condition (77) requires that 


] 1 
el) 
| Wo — ail, | (Wap-1 — Wep-2) il, 


exist, which is valid if and only if [s*Z.],, exists. According to Lemma 4, 
condition (777) is satisfied. 

A moments reflection, in view of the two expressions in (48), will 
show that the determination of the polynomial y(s) which leads to the 
weakest realizability conditions on Z(s) is essentially the same problem 
considered in the last section. The final result reads 





Theorem 6: 


The matrix Z is realizable with an impedance z(s) = (s + 1)(as” + 
as + 1)‘, where a > 0, if and only if 


i. || =: tt > 1 as, — (as” + 1), 4 0 


SVe 





2. [s‘Z.]., exists 


3. 1 — amy, 2 0 














where 
3 3 
1 Vom Vom—2 Vom Vam—1 1 Vom 
@=;/1—-(—")|-?+ —-=|1- 
3 Vom+1 Vom+1 Vam+41SLV2m+41 12 Vom+1 
2m+1 k 
and v(s) = Dro U8’. 


Further, if Z satisfies these conditions, the corresponding Z is realizable 
with n(s) = 1 when (vom41)(V2m) = 1 and with 


ns) = e439 [1-2 | 
2m+1 
when Creer el a) ae > 1. 
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VI. CONCLUDING OBSERVATION 


It is of interest to note that the conditions presented in Theorem 5 
[ze = (as + as + 1)(s + 1) ] reduce to those of Theorem 
3 [2 = (s + 1)7’] as @ approaches zero. However, a similar situation 
does not occur with respect to Theorems 4 and 6, for here the behavior of 
the matrix of even-parts at infinity is critically dependent upon whether 
a=0Q0ora>0. 
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Stimulated Emission of Bremsstrahlung 


By D. MARCUSE 
(Manuscript received May 31, 1962) 


A formula for the probability of stimulated emission of Bremsstrahlung 
ts derived. It ts shown that stimulated emission occurs if the incident elec- 
trons travel in a direction roughly parallel to the electric field vector of the 
wave stimulating the emission. 

Emission from free electrons 1s used in electron tube devices. The purpose 
of this paper is to show that stimulated emission occurs already in the 
elementary process of the encounter of one electron and one nucleus or ton. 
There ts no need for a slow-wave structure or elaborate electron bunching 
and no need to consider phase relationships. This elementary effect of 
stimulated emission should lead to a type of oscillator and broadband am- 
plifier working without slow-wave structures or need for the close mechanical 
tolerances of high-frequency klystrons. A device of this kind may be noisier 
than a conventional maser. 

It may be that the effect discussed in this paper is responsible for some 
of the hitherto unexplained semiconductor diode oscillations which have 
been reported in the literature. 


I. INTRODUCTION 


Stimulated emission of radiation from atoms or molecules is the 
process by which a maser operates. All masers use the transitions between 
bound states for their operation. 

However, it is also well known that radiation can be emitted from a 
free electron in the presence of a static electric or magnetic field. The 
presence of this static field—for example the field of a nucleus—is 
necessary to simultaneously conserve energy and momentum during the 
emission or absorption process. Free electrons far from any other field 
can neither emit nor absorb photons of an infinitely extending radiation 
field because if they did, conservation of energy and momentum would 
be violated, as can easily be shown. Free electrons can only scatter 
photons, a phenomenon known as the Compton effect. 

The emission of radiation from an electron passing by a nucleus is 
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known as Bremsstrahlung. It will be shown in this paper that stimulated 
emission of Bremsstrahlung is possible if the incident electron travels 
more or less parallel to the electric field vector of the stimulating radia- 
tion field. The electron absorbs radiation if it travels more or less per- 
pendicular to the electric field vector of the stimulating field. 

Since stimulated emission of Bremsstrahlung exists, amplifiers and 
oscillators may be constructed using this effect. Stimulated emission 
of Bremsstrahlung does not require any bunched electron beams or 
observation of phase relationships. Moreover, it works better with slow 
than with fast electrons, and no traveling-wave structure is required 
since the fields for the stimulation process can be confined in a cavity. 


II. STIMULATED EMISSION OF BREMSSTRAHLUNG 


The theoretical principles required to derive the probabilities for 
stimulated emission or absorption from free electrons in the presence of 
a Coulomb field can be found in the textbook by Heitler.’ We limit 
ourselves to nonrelativistic electron velocities and derive the probability 
for transitions between the following two states. The initial state consists 
of a free electron represented by a plane wave existing in the presence 
of a Coulomb and a radiation field with a certain number n of photons 
in a particular mode, while all other modes are empty. The final state 
consists of the same electron with different energy and momentum and 
with a number of n + 1 photons in the case of stimulated emission, or 
n — 1 photons in the case of absorption. 

The transition probability per unit time is given by” 


2 
w= = | Keo |’ pr (19% 


with % = 1.05 X 10°’ erg-sec 
2 Very , HernVuo \ 
Koa = gt @) 


FE, is the initial energy of the whole system, while E’ and EK” are the 
energies of the system in the intermediate virtual states. In the transi- 
tions to the intermediate states, energy need not be conserved. However, 
energy conservation is certainly required between the initial and final 
states of the whole system. 


y} 
Be ee ies (3) 
Qn 


* A list of symbols is given below in Section VII. 
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42 

E' =F + (n+ Iho (4) 
y2 

ER” = _ +. ni. (5) 


The (+) or (—) signs in (4) refer to emission and absorption respec- 
tively. p = mv is the momentum of the electron and w is the angular 
frequency of the electromagnetic field. 

The summation in (2) extends over all possible intermediate states. 
The matrix elements of the interaction Hamiltonian are given by™* 


Hn» = 1 e 2Qrh [vn + i i / gtk! ty, FOr pike r gt, 











—VEmV oe | va JB . 
— Mie he) ¢ 4/2 vn eT ee ee 
TER mV a Yin |B neat Baar Bie 
_— 1 ie , [xh [ae ied / ee eee 
Pa Va A Se 
_ — Rilke FB.) @ | /2ah [Vn+] (7) 
Vis m wo | Vn 


Oke" ke F448, Ok," ky FP 4B, Ok,".b, P48, 


The 6’s are the Kronecker 6-symbols. The k’s and 8’s can assume only 
values of the form 27n’/L with integer n’ as a result of box normaliza- 
tion. e and m are the charge and mass of the electron, n is the number of 
photons in a large box of volume L’ (box normalization). The upper 
values /n + 1 and (—) signs belong to the emission case, while the 
lower values »/n and (+) signs belong to the case of absorption, 6 is 
the propagation vector of the plane electromagnetic wave with 6 = 
|8| = w/c, k is the propagation vector of the plane electron wave with 
h |k| = mz, and p, is the component of the momentum operator of the 
electron in the direction of the electric field vector of the radiation field. 
We choose as z-direction the direction of propagation of the stimulating 
light wave 





6 = (0,0,8). (8) 


The direction of polarization (direction of electric field vector) is taken 
to be the x-direction so that 


* Electrostatic ¢.g.s. units will be used throughout this paper. 
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0 
,= -h—. 9 
p th (9) 
In comparing the matrix elements (6) and (7) with Heitler’s formulas, 
it has to be remembered that Heitler writes all momenta multiplied by 
the velocity of light c. He also drops the normalization factor L’ taking 
his box as being of unit volume. The integration extends over the box of 
volume L’. 
The matrix elements of the Coulomb potential are given by 
k 3 | gr age og Ze 4a 
= — Z 4 ed — — ———— 
Wan 7348 lr-n| |e —k’- 
La / Ci i ae 


Va ay |r—fel- eS Te eee (11) 


(10) 


The result of the integration holds in the limit L — « and can be found 
in Ref. 4. Z is the number of charges of the nucleus, and r — r, is the 
distance between the point of integration and the nucleus. The factor 
pr in (1) is the number of final states per unit energy range of the electron 
after scattering. It is 


mL k* 
pr = 72 (Qn) (12) 
with ik” = mvp the momentum of the electron after scattering and 
dQ = siny dy da (18) 


the element of solid angle of the electron scattered in the direction y and 
a. The relative orientations of y and a@ and the angles @ and ¢ of the 
incident electron are shown in Fig. 1. 

The form of the matrix elements (6) and (7) contains a physical 
approximation. We have limited ourselves to an expansion of the electron 
wave function in terms of plane electron waves, as is apparent from the 
factors e+"* appearing under the integrals. Using plane waves to de- 
scribe the electron corresponds to the Born approximation, which holds if 


Ze 
Qn Bp. <K 1. 


Our probability function w, of (1), describes the differential probability 
per unit time that an electron incident with angles 6 and ¢ emits (or 
absorbs) a photon into the existing plane wave carrying n photons, and 
that the electron is found with energy 


Amp = 4mv, =F hw (14) 
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Z 





Fig. 1 — Relative orientation of the incident and scattered electron with 
pace to the direction of propagation and the electric vector of the radiation 
eld. 


traveling within the solid angle dQ in the direction y,a. To obtain the 
total probability W per unit time regardless of the direction of the scat- 
tered electron, we have to integrate w over all directions of scattering. 
Before proceeding further, we have to discuss the influence of the box 
normalization. The size of the box is arbitrary. The results become 
independent of the box if its sides L become infinitely long. It is ap- 
parent that as L — © we get w — 0. To avoid this difficulty we consider 
that for a box of finite size we get as the number of emitted (or absorbed ) 
photons per second . 


AN = WN.” 


if N.” electrons are present in the box. Introducing the number N,’ 
of electrons per unit volume we get 


AN = WLN,. 


Calling N, the number of incident electrons which per second fly through 
the unit area at speed v, , we obtain 


Vee 
AN =W—N.=<cNe. (15) 


Vo 


The quantity o defined by (15) is the scattering or interaction cross 
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section. This name is justified by the observation that o has the dimen- 
sion of an area. 

The scattering cross section is, according to (15), defined by « = 
W(L’/v,). If we use the differential probability w of (1) instead of W, 
we obtain the differential scattering cross section 


3 


dc =w—. (16) 


Using all the equations from (1) to (15), we obtain from (16) the dif- 
ferential cross section 
_ 82e'ZNk"dQ ke 
MMe Es = a | |e _ al 
( 2m 


kn” - 


lens “| fee” | | 
2m 


The summation over all energy states reduces to one term because of the 
6 symbols in (6) and (7). We have taken 


oO 


da 


(17) 


N=-— (18) 


with N being the number of photons per unit volume. The normalization 
factor L has been eliminated from (17) and we can safely let L — «. 
We see that the two factors (n + 1)/L* for the emission case and n/L* 
for the absorption case become identical as L — o. The term 1 in 
n + 1 is related to the spontaneous emission of radiation. Since we 
calculate the transition probability (or scattering cross section) for 
emission of radiation into one well defined mode, without allowing for 
a spread in frequency or into several closely spaced modes traveling 
within a certain element of solid angle, the probability for spontaneous 
emission must be zero as L > «.* 

Only those terms give a contribution to the summation (2) for which 


k’ =k’ +6 or p’ =p +6 (19) 
and 
k” =k" +86 or p” =p’ + fs. (20) 


* If the radiation is confined to a cavity, L? is the volume of the cavity and the 
factor 1/L3 does not go to zero. The 1 in (n + 1)/L gives rise to spontaneous 
emission of noise. 
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These relations, which follow from the 6-symbols in (6) and (7), mean 
that conservation of momentum is required even for the virtual states. 
We obtain with (19) and (20) 


ki —-k =k —k” =k —k" +. (21) 
With the help of Fig. 1 the following relation can be derived 
(ko — kP = Bl = kh? + kh” + 6 = 2k°B cos 0 & 2k"B cosy 
— 2k’k"(cos 6 cos Y + sin @ sin y cos a). 
Using (14) and (19) through (22) we obtain by integrating (20) 
QameZN f”* 


= ay [da sin y VBA | 


(on 
a Vow B?kFh' 


(22) 


sin 6 cos ¢ 
a—cos O-+4e 
a/1=F2ae sin w cos (g + 2] : 
/ T=F2ae- cos ¥ — ack }e (28) 
-[1 = ae + 4e° = € cos 0 + € W/1F 2ae cos W 
— 1/1 2ae (cos 6 cos ¥ + sin 6 sin W cos a)|”. 


The upper subscript e of o indicates emission and corresponds to the 
upper (+) or (—) signs in the equation. Conversely, the lower subscript 
a relates to absorption and corresponds to the lower signs. 

We have used the abbreviations 


Ne 
. (24) 


with \, = 27/k° being the wavelength of the electron wave and \ = 
27/8 the wavelength of the electromagnetic radiation. 


(25) 


In going from emission to absorption, all we have to change is the sign 
of «. 

In order to decide if emission or absorption of radiation will actually 
occur, we have to take the difference 


or = Oe — Ga (26) 
between the cross sections for emission and absorption. 


We do not have to evaluate the integral (23) exactly. In the range 
of physical interest we will always have e < 1 and also ae K 1 buta> 1 
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for nonrelativistic velocities. We limit ourselves to the case a = c/v > 1 
and obtain approximately* 


af 2QameN Zv |" 

ash hse? .'p 
[ aw sin y 1/1 2ae [sin 6 cos 9 (27) 
0 


— +/1¥2ae sin y cos (a + ¢)]’ 
[lFae — ~/1F2ae (cos 6 cos YW + sin 6 sin y cos a)]’ 


(e4 








The double integral can be solved. The solutions have been expanded in 
terms of ae, and the difference between oc, and oa has been taken. 
We obtain to the lowest nonvanishing order of ae = Af/mv’ 


: Lay 


ls mvs? 


2 
(3 cos’ ¢ sin? 6 — 1) nee — 2 cos’ ¢ sin’ | . (28) 


If ¢ = Oorz and if 6 = 7/2, the cross section or assumes its largest 
positive value 


6 2 2 
_ 8e'NZ [ 2mv i; (29) 


oop Lae 

The direction of the incident electron defined by ¢ = 0, 7, and @ = 7/2 
is parallel to the electric vector of the radiation field (Fig. 1). Electron 
incidence parallel to the electric vector of the radiation field gives rise 
to stimulated emission of radiation. 

Equation (28) shows that or < 0 if g¢ = 2/2. The electron is incident 
perpendicular to the radiation field and absorbs power from the field. 

Let us take another look at (28) and determine the directions of 
incidence for which stimulated emission rather than absorption results. 

Since an exact solution is hard to give, we will assume that the log- 
arithmic factor in (28) is considerably larger than the remaining factor 
in the bracket consistent with the assumption mv’/hf > 1. 

We can then give approximate angles for which or = 0. 

If we let ¢ = 0, we obtain 





2 pik Oba ay eee (30) 
Z In ame i 
hf 


The angle 0.954 in radians corresponds to 54.6°. 


* Terms with e (not ae) and terms of order 1/a are neglected. Terms with ae 
have to be kept because the difference o- — oa is proportional to ae. 
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If we let 6 = r/2 we obtain 


0.236 
2Qmv? . 
hf 


All electrons incident on the ion within a cone with a cone angle of 
approximately 108°, whose axis is parallel to the electric vector of the 
stimulating radiation field, contribute to stimulated emission. 

The number AN of emitted photons per second is obtained if we mul- 
tiply or by the number WN, of electrons which per second penetrate the 
unit area containing the nucleus with the charge Ze. If there is more than 
one nucleus interacting with the electron stream, we obtain the total 
number of emitted photons per second by multiplying the total number 
of nuclei VN, with the electron flux density N. which per second interacts 
with them, provided the density of nuclei is low. For electrons incident 
parallel to the electric field vector ¢ = 0 or z, and 6 = 7/2 we obtain 
from (29) 





g = 0.954 — (31) 





In 


6 72 2 
AN Z see e 2mv | (32) 


N miy'f? 
III. DISCUSSION 


Equation (32) shows that the ratio of emitted photons per second to 
the stimulating photon density decreases rapidly as the electron velocity 
v increases. Within the limits set by (mv’/hf) >> 1 and by 2x(Ze’/fiv) K 1 
we obtain more emission with slower electrons. The number of emitted 
photons increases also with decreasing frequency. 

It may be useful to remark on the coherence of the process. We have 
to keep in mind that (82) gives the number of photons added precisely 
to the radiation mode which stimulates the emission. Let us assume that 
all other possible radiation modes are empty, N = 0. If we ask for the 
number of photons emitted into one of the empty modes, our results 
would be proportional to 1/L*, but it is proportional to (hf/mv’) - (n/L’) 
in case of emission into the radiation mode filled with n photons. (Re- 
member the remark about (n + 1)/L’ following (18). The factor hf/mv” 
arises by taking the difference o. — o.. The zero-order term drops out 
and the result is proportional to ae = hf/mv’.) The radiation from the 
electron occurs preferentially into the occupied mode with a probability 
ratio equal to [(hf) /(mv’)]n (nis a big number even for moderate power). 
The power added to the occupied radiation mode is coherent with the 
radiation present in that mode because each photon adds precisely the 
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energy hf. We have here the same situation which exists for stimulated 
emission from bound states, with the difference that the spontaneous 
emission of bound electrons has a well defined frequency while the free 
electron radiates into a very wide frequency band. Because of the ratio 
of spontaneous to stimulated emission, we would assume that the process 
discussed here is inherently noisier than stimulated emission from bound 
states. For the latter, the ratio of stimulated to spontaneous emission 
into the occupied mode is n while it is only [(hf)/(mv’)]n in our case. 
Apparently we get better results for slow electrons. 

We want to state the oscillating condition for achieving self-sustained 
oscillation if the radiation is confined in a cavity. The cavity has a 
loaded Q defined by 


NVhf _ wNV 


Q = W nh nt (33) 








with cavity volume V, photon density N, and n’ dissipated photons 
per second. The cavity oscillates if n’ = AN. With the help of (82), this 
leads to an expression for the minimum product necessary to achieve 
oscillation if we assume again that the electrons are incident parallel to 
the E-vector. 


amv f °V 


The oscillating frequency is determined only by the cavity. If many 
cavity modes can exist, oscillation will start in the mode with the smallest 
value of f’/Qz . 

For use as an amplifier the bandwidth could be much larger than the 
bandwidth of conventional masers because there is no built-in resonance 
in this process to limit the frequency. 

To build an amplifier or oscillator using stimulated emission from free 
electrons, we want to shoot dense electron beams of low energy through 
as dense an ensemble of ions as possible. This can be done by using ion 
beams rather than ions in a plasma because in a plasma scattering of the 
thermal plasma electrons by the plasma ions would have an adverse 
effect. 

It is also possible to use electron scattering due to ionized impurities 
in a crystal to obtain stimulated emission. To apply our theory to 
electrons moving in conduction bands of crystals, we have to replace 
the electron mass m by the effective mass mer: of the electron in the 
crystal. Furthermore, we have to multiply the scattering cross section 
o by 1/e with e being the dielectric constant of the crystal. 


N.Na = 


STIMULATED EMISSION OF BREMSSTRAHLUNG 1567 


Experimentally, one has to take care that the conduction electrons 
move predominantly in the direction of the electric vector of the radia- 
tion field. This is achieved, for example, by lifting electrons into the 
conduction band with the help of the photoelectric effect using a crystal 
which in the dark is an insulator. The electrons will move predominantly 
in the direction of the electric vector of the pump light provided that 
this is polarized. 

Another way of generating electrons moving in preferred directions 
is by the use of de electric fields which exceed the breakdown voltage 
of an insulating or poorly conducting crystal. 

The crystal will have to be cooled to increase the collision time for 
electron collisions with the vibrating lattice which have an adverse 
effect on the ordered electron motion and may not give rise to stimulated 
emission.* 

Another way to achieve stimulated emission of Bremsstrahlung is the 
use of crossed electron and ion beams. These two beams can be made to 
cross in a capacitor which is part of an LC resonant circuit. The use of 
beams in vacuum will limit the application of our effect to low-frequency 
amplifiers and oscillators because of the limitation in available ion densi- 
ties. 

Equation (34) can be applied to the case of an LC circuit. In this 
case, V is the volume of the capacitor and Q, the loaded Q of the LC 
circuit. An independent calculation has shown that (34) can be obtained 
if the voltage and current in the LC circuit rather than the electric and 
magnetic field in the capacitor are quantized. The photons in this case 
are the units of energy Af which are stored in the resonant circuit. 


IV. NUMERICAL EXAMPLES 


t. We assume an LC circuit with Q;, = 30 tuned to 70 me, a capacitor 
volume V = 12.5 cm’. The electrons are assumed to be accelerated by a 
potential of 10 volts, corresponding to a velocity of 2 X 10° cm/sec and 
to drift through the capacitor plates, which are made out of a wire mesh. 
To obtain oscillations, we obtain from (34) (Z = 1 is assumed) 


N.N, = 4.6 X 10” sec em” 


If we assume that an electron beam with 10 ma/cm’ current density, 
corresponding to NV, = 6.3 X 10° sec ‘ em’ ”, is employed, N, = 7.3 X 
10” ions are needed in the interaction region to make the circuit oscillate. 


*So far we have studied stimulated emission in the presence of Coulomb 
potentials. Whether other scattering potentials can be used will have to be de- 
termined by another study. 
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Assuming that the capacitor is made of a mesh of 5 X 5 cm’ with a 
spacing of 0.5 cm, and assuming further that the ions are accelerated by 
a 10-volt potential, we find that the ion current of 54 ma has to pass 
through the capacitor (Cs ions assumed) in order to provide N, = 
7.3 X 10” ions in the capacitor at any instant of time. 

a. We consider a solid with a dielectric constant « = 10, an effective 
electron mass Mere = 0.1 m and assume that the material is contained in 
a cavity with volume V = 20 em‘ which is resonant at 10 kme with a 
Q: = 10,000. The electron velocity is assumed to be v = 10’ cm/sec. 
We obtain, taking Z = 1, 


N.N,, = 2.8 X 10” sec em™” 


If we take the electron current density equal to the one used in the 
first example, V. = 6.3 X 10°° sec’ em”, we obtain as the number of 
ions in the interaction region necessary to sustain oscillations, N, = 
4.45 X 10" or a density of 2.2 X 10" ions/em’. 

The frequency could be increased further if it were possible to increase 
the ion and electron densities. Increasing the frequency to 100 kme 
increases the product N,N, by a factor of 1000. It may be possible to 
increase N, as well as N, by a factor of 30 and push the operating 
frequency to 100 kme. Increasing the Q of the cavity would also help. 


Vv. A COMPARISON WITH CLASSICAL THEORY: 


It is interesting to assume that electrons are incident from all possible 
directions with a uniform distribution over all angles of incidence. 
We obtain from (28) 


7 20 6 2 
: 32re NZ 
[ sin 6 dé ; dear == Smnivif? . 


z= (35) 
Electrons incident with a random distribution over all directions lead 
to a net absorption of radiation. This absorption process gives rise to 
attenuation of waves traveling through plasmas or semiconductors. 
However, in a semiconductor our process gives only the contribution 
of impurity scattering but not of lattice scattering. Equation (35) does 
not contain A any more and must be equivalent to classical theories. 
Equation (35) could be used to derive an expression for the attenua- 
tion constant @ in an ionized gas. However, the approximations implied 
in this work do not allow the electron velocity to become arbitrarily 
small. We will, nevertheless, use (35) to derive the attenuation constant 
per centimeter of a plane wave in a plasma with electron density n, and 
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ion density n; with the assumption that the electrons obey the Maxwell- 
Boltzmann velocity distribution. 


3/2 2 
= 2 m - mov 
f(v) = 4x0 | ea exp ( rary (36) 
It is 
AN 


where AN is the number of photons created per centimeter per second, 
c is the velocity of light, and N is the photon density. 
We obtain with 





AN = — [ of (v) & dv (38) 


from (37) with the help of (35), (36) and (38) 
as 672 5 re 
Doss ‘3 /% 1 nage tvs 1 a (39) 


3V 3 (kT)? miPfc {Qe Jy x 


The integral in (39) cannot be evaluated since it requires an integration 
over 


J m_ 
t= 04) 5p 


starting from zero. The integral is logarithmically divergent. However, 
our theory does not allow us to apply (35) for arbitrarily small electron 
velocities. Nevertheless, it is interesting to note that the part of (39) 
inside the brackets equals exactly equation (5-48) of Ref. 5 if we follow 
Spitzer’s indication and multiply his formula with 

es or S fy 

kT 

to take stimulated emission into account. The divergent factor outside 
the brackets should be one. We cannot derive its value because of the 
limitations of our approximation. 


VI. SUMMARY 


We have presented an approximate theory of stimulated emission of 
Bremsstrahlung. The theory considers the process of stimulated emission 
which occurs if a stream of electrons is scattered by one individual ion 
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or nucleus in the presence of a radiation field. The approximations require 
that 


2 
= 3551, Ie er ee ea 
v mv hv 

We have found that stimulated emission occurs if the directions of the 
incident electrons lie within a cone whose axis is parallel to the electric 
vector of the stimulating radiation and whose cone angle is approximately 
108°. 

The effect is fairly weak, so that rather high ion and electron densities 
are required to achieve substantial amplification or oscillation. 

It is predicted that the effect will work best in a solid when the conduc- 
tion electrons are scattered by ionized impurities. 

It may be that this effect is responsible for the recently reported 
oscillations which were obtained with the use of semiconductor diodes 
and which seem presently to be unexplained.®:78 

It is possible that other scattering potentials may lead to stimulated 
emission. We have restricted ourselves to Coulomb scattering. It may 
be worthwhile to study other scattering potentials, for example electron 
scattering by the vibrating lattice in semiconductors. 


Vil. LIST OF SYMBOLS 


a used as azimuth of the scattered electron 
relative to the incident electron; also at- 
tenuation constant 

B= w/e propagation constant of electromagnetic 
wave 

Cc phase velocity of electromagnetic wave 

e = 4.803 X 10°” e.s.u. charge of the electron 

f frequency of the electromagnetic wave 

¢ azimuth of the incident electron 

h = 6.624 X 10°” erg-sec. Planck’s constant 

h = h/20 = 1.054 X 10°” erg:sec. 

Hy matrix element of radiation field 

k propagation constant of electron wave; also 
Boltzmann’s constant = 1.38 x 107° 
erg: degree * (appears only in the combin- 
ation kT’) 

L length of fictitious box used for box nor- 
malization 
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m = 9.107 X 10° * gram mass of the electron 
N; ion density 
Ne electron density 
Ne number of electrons penetrating the unit 
area per unit time 
N,, number of ions or nuclei 
N photon density 
Q solid angle 
w = 2nf angular frequency of the electromagnetic 
wave 
p = hk momentum of the electron 
y polar angle of the scattered electron 
Qr loaded Q of the resonant cavity 
Pr density of physical states per unit energy 
o interaction cross section 
T absolute temperature 
6 polar angle of the incident electron 
v electron velocity 
V ro matrix element of the Coulomb potential 
w differential transition probability per unit 
time 
W transition probability per unit time 
Z number of elementary charges of the ion. 
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Diffusion Length Measurement by Means 
of lonizing Radiation 


By W. ROSENZWEIG 
(Manuscript received June 6, 1962) 


Penetrating radiation in the form of high-energy electrons, heavy par- 
ticles, and gamma rays may be used to determine minority-carrier diffusion 
lengths in semiconductor materials containing junctions by measuring the 
radiation-induced short circutt current. The electron-beam method yields an 
accurate absolute determination of diffusion length once the carrier-genera- 
tion rate as a function of depth in the material has been measured. A series 
of such experiments 1s described for silicon solar cells utilizing electrons 
ranging in energy from 0.61 Mev to 1.16 Mev. A resultant maximum 
generation rate of 2.25 X 10° + 5 per cent carriers/cm per incident 1 Mev 
electron ts obtained at a depth of 0.096 gm/cm*. Measurements with 16.8 
Mev and 180 Mev protons, and Co® gamma rays are found to be in good 
agreement with the electron-beam measurements. An experimental ar- 
rangement 1s described which yields rapid and accurate diffusion-length 
measurements of solar cells under conditions in which radiation damage 1s 
negligible. 


1. INTRODUCTION 


The minority-carrier diffusion length is an important quantity char- 
acterizing the properties of a semiconductor material or device, par- 
ticularly in any situation involving the transport of charge by minority 
carriers. It is related to the steady-state minority carrier lifetime through 
the expression L = +/Dz, in which D is the diffusion coefficient and + 
is the lifetime. A variety of methods have been employed to measure 
diffusion length in the steady state, or lifetime under transient condi- 
tions and are reviewed by Bemski.! The steady-state methods have a 
particular advantage in that the measurement is uncomplicated by 
trapping effects. One particular steady-state method suggested by 
Gremmelmaier? involves the use of gamma radiation to generate excess 
carriers at a known rate in a pn junction device. The radiation-generated 
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short-circuit current may then be used to calculate diffusion lengths 
when the device geometry is known. 

It is the purpose of this paper to discuss, more generally, the use of 
penetrating ionizing radiations for diffusion-length measurement. Ex- 
perimental results will be presented for energetic electrons and protons 
as well as gamma rays. It will be shown that of the three, electrons lend 
themselves readily to an accurate absolute determination, with an ac- 
curacy of -+-5 per cent having been achieved. Even though the method 
is quite general it will be treated in terms of one device, the silicon solar 
cell. It is the radiation damage study of this device which led to the 
development of the technique. 

In the text which follows, Section II will be devoted to the theoretical 
background for the diffusion-length determination. Section III will be a 
description of the energetic electron measurements and the analysis 
which yields the absolute calibration. Sections IV and V will deal with 
protons and gamma rays, respectively, and give experimental results 
which are compared with the electron beam measurements. A brief 
discussion of the radiation damage problem will be given in Section VI 
along with some quantitative information which shows that measure- 
ments can be made in most practical cases with negligible damage. 


Il. THEORY 


Radiation incident on the surface of a material having a shallow 
diffused junction, such as a solar cell shown schematically in Fig. 1, 
will generate excess carriers at a rate which is a function of the depth 
from the surface. The function depends on the nature of the radiation. 
That portion of the carriers which is produced deep in the bulk of the 
material can be collected only if the carriers diffuse to the junction. 
Thus, for deeply generated carriers, the bulk diffusion length is of 
primary importance in determining the collection efficiency. In fact, 
because of the very shallow junction in solar cells, the diffusion length 
is the only parameter which governs the collection efficiency for pene- 
trating radiation, the front layer becoming important only for non- 
penetrating radiation. For the case in which excess carriers are gen- 
erated uniformly throughout the material at a rate of go carriers per 
cm’ per second, the collected current density, J;,, under short circuit 
conditions is given by 


ey — egoL (1) 


where e is the electronic charge. (Derivation can be found in the Ap- 
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Fig. 1 — Schematic representation of the silicon solar cell. 


pendix.) Equation (1) indicates that the quantitative determination of L 
requires a measurement of current, under short circuit conditions, when 
a known area of the device is ‘alluminated” with radiation which is 
capable of producing excess carriers uniformly throughout the bulk of 
the material and at a known rate. Light which has been filtered with 
material which is identical to that of which the cell is made (silicon in 
this instance) would satisfy the requirement of uniform generation rate. 
However, the requirement of knowing quantitatively the generation 
rate is difficult to meet, because the amount of reflection of the incident 
light depends on the condition of the surface, and the generation rate 
due to the transmitted beam depends strongly on the exact width of 
the energy band gap of the filter relative to that of the device. 

The types of radiation which satisfy both needs are energetic nuclear 
particles, electrons, and gamma rays. As these radiations penetrate the 
material, a major portion of their energy is spent in causing transitions 
of valence electrons to the conduction band, i.e., the generation of hole- 
electron pairs. A small fraction of the energy produces damage by 
creating lattice vacancies and interstitials through nuclear collisions 
resulting in a reduction of carrier lifetime and change in carrier con- 
centration and mobility. The rate at which silicon is damaged by the 
various types of radiation will be briefly discussed in Section VI with 
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the conditions necessary to produce negligible ugmaee during a meas- 
urement of diffusion length. 

The average amount of energy required to peulaee a hole-electron 
pair in silicon has been measured to be 3.6 ev, a number which appears 
to hold well over a wide range of particle energy and types.? The de- 
termination of go for a particular case thus requires a knowledge of the 
rate at which energy is given up to the material. Moreover, to assure 
oneself of the uniformity of the generation rate, it is also necessary to 
know its rate of change with depth into the material. If the latter 
quantity be designated by g: , then it is shown in the Appendix that the 
nonuniformity can be neglected to the extent that (g:/go)L is small 
compared to one. 

Tor the cases of the charged particles it is convenient to introduce 
one other quantity, the average specific ionization, which is the average 
spatial rate of hole-electron pair production per incident particle. If the 
generation rate, at some depth in the material, is g, due to N charged 
particles normally incident per second per cm? of surface, then the 
average specific ionization, S, at that depth is given by 


S = = pairs/em. 2 

£ pairs/em (2) 
For the situation of uniform carrier generation, (1) may be rewritten in 
the form 


5 = JSoL (3) 


where J is the incident current density for singly charged particles, and 
So is the uniform specific ionization. 


III. ELECTRONS 


Of the charged particle radiations, electrons can be handled most 
easily experimentally. This is due to the fact that accelerators for 1 
Mev electrons, which are sufficiently penetrating for the purpose at 
hand, are relatively simple in design and can be operated quite reliably. 
Beam intensity calibrations can be carried out accurately, for electrons, 
with Faraday cups of not too complicated design. This situation is in 
contrast to heavy particle accelerators which are large and complex 
installations. 

The theoretical calculation of the average specific ionization as a 
function of depth per incident electron is difficult.’ However, it is easy 
to evaluate this function experimentally, as was done, in the following 
way: A pre-bombarded silicon solar cell was exposed to a broad beam of 


— 
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monoenergetic electrons normally incident to its surface. The reason for 
the pre-bombardment was two-fold: (27) the diffusion length of the 
minority carriers was reduced to only a few microns so that the spatial 
variation of ionization in the sensitive layer was negligible, and (iz) 
the diffusion length did not change appreciably during the measure- 
ments. The solar cell short-circuit current density, Js. , was measured 
and the electron beam calibrated to obtain the incident electron current 
density, J. . By forming the ratio J,-/J- one has determined the average 
ionization per incident electron in the spatial interval a distance LZ from 
the junction. This quantity was then obtained as a function of depth by 
placing absorbers, approximately equivalent in atomic number to the 
cell material, on top of the cell and measuring the resultant short circuit 
current. The placing of the absorbers on the cell was equivalent to mov- 
ing the sensitive layer, in which the ionization was measured, through 
the volume of the material. Care was taken in this procedure that the 
absorbers overlapped the edges of the cell sufficiently so that electrons 
scattered out at the periphery of the cell were compensated by equivalent 
electrons scattered in. 

The results of a series of such experiments are shown in Figs. 2-5 for 
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Fig. 2 — Ratio of collected current density to incident electron current density 
vs depth for 1.16 Mev electrons in a silicon solar cell of 8.75 » diffusion length. 
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Fig. 3 — Ratio of collected current density to incident electron current density 
vs depth for 0.98 Mev electrons in a silicon solar cell of 8.75 » diffusion length. 


incident electrons of 1.16 Mev, 0.98 Mev, 0.80 Mev and 0.61 Mev 
energies respectively. They are plots of the ratio J s:/J- versus depth in 
silicon. (The actual absorbers used were made of aluminum, which is 
sufficiently close in atomic number to silicon to require only a density 
correction to yield equivalent thickness of silicon.) Since the integral 
over-all depths of the average specific ionization, S. , per incident electron 
is equal to the total ionization which the electron produces in the silicon, 
then by means of (8) one gets the following result: 


es | ee ae E 
.dx = 2 | = a 4 
i pe Se se (4) 





where J is the average energy absorbed in the silicon per incident elec- 
tron and e = 3.6 ev/pair. 

This average absorbed energy is very nearly equal to the energy of 
the incident electron. The latter quantity was measured with a com- 
bination vacuum Faraday cup-calorimeter, under scattering conditions 
of 2 mils of aluminum and 2 inches of air. The absorption measurements 
were made with additional scattering of 3.5 mils of aluminum and 3 
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Fig. 4 — Ratio of collected current density to incident electron current density 
vs depth for 0.80 Mev electrons in a silicon solar cell of 8.75 » diffusion length. 


inches of air. The extrapolated ranges obtained from these curves, when 
corrected for the additional air and aluminum, give initial electron 
energies, through the range-energy relationship,’ which agree with the 
calorimeter measurements within 3 per cent. Using the energies ob- 
tained by the extrapolated ranges and evaluating the integrals under 
the smooth curves, one obtains a set of L values for the cell used in 
these measurements as in Table I under the heading Uncorrected values. 
One observes that the caleulated L values decrease with decreasing 
energy. The cause for this is not clearly understood, but it 1s possible to 


TasBue I] — Dirrusion LENGTH VALUES COMPUTED FROM ABSORPTION 
CurvESs For ELEcTRONS OF VARIOUS ENERGIES 








Electron Energy (Mev) Uncorrected L{(p) Bacxscatter Corrected L(p) 
1.16 8.34 8.54 
0.98 8.25 8.48 
0.80 7.84 8.15 
0.61 7.64 8.04 
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Fig. 5 — Ratio of collected current density to incident electron current density 
vs depth for 0.61 Mev electrons in a silicon solar cell of 8.75 » diffusion length. 


enumerate two contributing factors. The average energy of the incident 
electrons is less than that determined by the calorimeter under less 
severe scattering conditions. Similarly, the extrapolated range is only a 
measure of the range of the most energetic components, the less energetic 
ones manifesting themselves only in the shallower portion of the ab- 
sorption curve. Secondly, a certain fraction of the incident energy flux 
is reflected back by way of secondary electron emission. The latter 
effect can be taken into account quantitatively by making use of some 
recently published data by Wright and Trump.* They give curves for 
the fraction of the incident electron beam energy which is scattered 
backward as a function of energy. When one makes use of the curve 
for aluminum, one arrives at ZL values shown in Table I under the head- 
ing Backscatter Corrected values. The variation with energy has been 
reduced but the trend still remains. If one makes the assumption that 
the correct L value is approached as the electron energy goes to infinity 
then one can attempt to obtain this value by plotting DL against the 


DIFFUSION LENGTH MEASUREMENT 1581 


reciprocal of the energy and extrapolating to zero. Such an extrapolation 
yields L = 9.0 » which is about 5 per cent higher than the value at 
0.98 Mev. If one now takes a value halfway between these two, namely 
8.75 pw, any remaining uncertainty will introduce errors of less than +3 
per cent. 

With the L value of the experimental cell thus determined, it is 
possible to normalize the ordinates of the curves in Figs. 2-5 to give 
the average specific ionization as a function of depth for the various 
initial electron energies. For example, 0.98 Mev electrons have a maxi- 
mum average specific ionization in silicon of 225 pairs/u and, for the 
scattering conditions of this experiment, have a surface ionization of 
152 pairs/u. The initial slope of the curve is 2.85 pairs/y? so that for a 
cell of 100 u diffusion length, g:L/go = 0.38 (see Appendix). 

The above nonuniformity in the carrier generation rate is large 
enough to make it desirable to make diffusion length measurements in 
the vicinity of the maximum where the variation in the generation rate 
is less than 2 per cent over a range of 250 uw. An experimental arrange- 
ment for readily doing this has been developed and takes the form of a 
double-aperture Faraday cup, which is shown schematically in Fig. 6. 
The cup is placed in the 1 Mev electron beam in a position such that 
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Fig. 6 — Schematic representation of the double-aperture vacuum Faraday 
cup for measuring Jec/Je . 


1582 THE BELL SYSTEM TECHNICAL JOURNAL, SEPTEMBER 1962 


equal electron fluxes enter both apertures. This is assured by making 
repeat measurements with the cup rotated 180° around its own axis. 
Under one aperture is a 12 mil aluminum foil, bringing the carrier 
generation rate near the maximum in silicon, which is followed by the 
cell whose diffusion length is to be measured. The beam entering the 
other aperture is collected in a Faraday cup. The ratio of the cell short 
circuit current to the cup current when divided by 225 will give the 
diffusion length in microns. Such measurements are carried out with suf- 
ficiently sensitive amplifiers so that an LZ determination can be made 
with an exposure of 10!° electrons/em?. The ratio is formed electroni- 
cally and is displayed in digital form. The measurements are reproduc- 
ible to within +3 per cent. 


1V. HEAVY PARTICLES 


For the case of energetic protons, deuterons, and alpha particles the 
situation is fairly simple. Since the particle mass is much greater than 
the electron mass, the incident particle passes through the material 
with only negligible deflection except for the extremely rare instance in 
which it undergoes a large angle nuclear collision. At any depth in the 
material, the specific ionization may be obtained by dividing the rate 
of energy loss, dH /dx, by the average energy required to produce a pair; 
i.e., 3.6 ev for silicon. 

As an example, Fig. 7 is a plot of the specific ionization and its spatial 
rate of change in silicon as a function of proton energy. The values were 
computed from a tabulation of dH /dx versus energy in aluminum as 
given by Sternheimer.’ These curves show that for minority carrier 
diffusion lengths of the order of 100 microns, a proton of initial energy 
greater than about 18 Mev will generate carriers uniformly to within 
2 per cent in a depth of one diffusion length from the surface. This im- 
plies a 2 per cent accuracy in (1) when a uniform generation rate of gc 
is used. The uniformity of carrier generation is improved with increased 
proton energy. 

Similar considerations may be applied to other heavy nuclear par- 
ticles. For quantitative information on dH/dx and range-energy rela- 
tionships, references such as Bethe and Ashkin® or Allison and Warshaw’ 
are useful. 

Diffusion length measurements were made with 16.8 Mev protons 
(Princeton Cyclotron) and 130 Mev protons (Harvard Cyclotron). The 
ionization currents were measured on cells of 1 cm? area and the beam 
current densities measured with IF'araday cups available at each of the 
two installations. When (3) is solved for L with Sy = 1.46 x 10’ pairs/ 
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Fig. 7 — Specific ionization and its spatial rate of change vs energy for protons 
in silicon. 


cm at 16.8 Mev and Sy = 3.03 X 10° pairs/cm at 130 Mev, the results 
in Table II are obtained. For comparison, the table also contains the 
diffusion length measurements obtained with the electron beam. As 
may be seen, agreement is obtained between the two methods for a 
large range of diffusion length values for both conductivity types and 
at two greatly different proton energies. 


V. GAMMA- AND X-RAYS 


Measurement with this type of radiation is discussed in the previously 
mentioned article by Gremmelmaier.? He shows that to a good degree of 
approximation, 


E 


go = NK (5) 
where 
N, = incident radiation flux in quanta per cm?-sec, 
kK = the absorption coefficient, 
EK = the average electron energy per absorbed quantum, 
€ = average energy per pair (3.6 ev for silicon). 
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TasBue IJ — Dirrusion LENGTH MEASUREMENTS OBTAINED BY MEANS 
oF Protons COMPARED WITH 1 MrEv ELEcTRON MEASUREMENTS 











Proton Ener, ( 
Cell Type Cell No. Men) ad by ie by Heehas 
nonp 211 16.8 21.0 21.5 
nonp 622 16.8 86.0 88.0 
non p 1112 16.8 29.3 82.5 
non p 1021 16.8 13.8 11.5 
ponn 5011 16.8 5.2 4.8 
ponn 5021 16.8 22.4 19.5 
ponn 280R, 16.8 8.7 7.9 
ponn 2838R 16.8 5.6 3.9 
non p 8112 130 5.4 5.5 
non p 10802 130 5.4 5.35 
nonp B-1 16.8 41.2 40.4 
non p 6112 16.8 12.0 11.5 
nonp C-2 16.8 20.4 21.2 
nonp D-2 16.8 8.2 7.8 
nonp 7112 16.8 2.3 2.3 
non p 7111 16.8 2.14 2.36 
non p 6911 16.8 2.2 2.4 
non p 7012 16.8 2.5 2.4 











For a given quantum energy, K and # can be calculated from tabulated 
absorption and scattering cross sections.!? However, the determination 
of N, for a given source and particular geometry is quite difficult. 

The use of this radiation does offer definite advantages. The rate of 
degradation by the radiation is substantially lower than for the case of 
energetic particles since the secondary electron flux is weighted toward 
the low energy end, with a sizable portion below the damage threshold 
energy. It is highly penetrating, so that measurements can be made 
through appreciable thicknesses of absorber. For the same reason, the 
carrier generation rate is uniform to great depth. The advantages of a 
gamma- or X-ray source can be utilized without the evaluation of the 
various constants in (5) by calibrating with a cell of known diffusion 
length, the latter quantity having been determined with electrons. 

Solar cell short circuit currents were measured in a 10 kilocurie Co® 
source. The radiation level within the source was about 0.87 X 10° r/ 
hr. A number of cells whose diffusion lengths had previously been de- 
termined using the electron beam were placed inside the source and 
their short circuit currents measured. The ratio of current to diffusion 
length for each of the cells is shown in Table III. As may be seen, the 
ratio undergoes variations up to 20 per cent. This is caused primarily 
because the cells did not occupy positions of equal intensity in the y-ray 
source. It is thus reasonable to conclude that there is good agreement 
between a diffusion length measurement in the y-ray source and under 
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Taste IJJ — Gamma Raptation InpucED SHort Circuit CuRRENT 
NORMALIZED TO 1 Mev ELEcTRON BEAM DETERMINED 
DIFFusion LENGTH 














Cell Type Cell No. L(u) “na 
non p 2A 18.6 174 
non p 2B 19.0 159 
non p 3A 17.7 141 
non p 3B 17.9 154 
ponn 5A 4.3 .176 
ponn 5B 4.4 171 
ponn 6A 7.0 .168 
ponn 6B 5.2 170 











the electron beam, regardless of whether the minority carriers are holes 
or electrons. Since the area of each of the cells used was 1 cm?, the average 
value of the ratio gives a calibration for the particular y-ray source of 
0.162 wa/y-cm?. This number may be converted to an absorbed dose 
rate" and yields 0.88 X 10° rad/hr. By means of the definition of the 
rad and the roentgen, and using the mass stopping power of air relative 
to silicon one can convert this number to correspond to 0.81 X 106 r/hr 
which is in reasonable agreement with the existing calibration. 


VI. RADIATION DAMAGE 


The damage produced by energetic radiation causes the minority 
carrier diffusion length to decrease in a well defined way. Fig. 8 gives 
the experimental results for the measured diffusion length as a function 
of 1 Mey electron flux for an n-diffused, p-type 1 Q-cm base silicon solar 
cell. The solid curve is a plot of the theoretical formula 


if 1 
p-pat ke (6) 
in which L is the diffusion length after some bombardment flux &, and 
Lo the initial diffusion length, with Io = 140 wand K = 1.8 X 107". 
The value of K depends on many parameters such as the energy and 
type of bombarding particle, the resistivity and conductivity type of 
the material, the temperature, and the impurity concentration. The 
variation of K with particle type and energy will be the subject of other 
papers. Table IV summarizes some of the results for the radiations 
mentioned in this paper and for silicon solar cells made from pulled 
crystals and irradiated at room temperature. 
From this one can determine, for example, the per cent change in 
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Fig. 8 — Degradation of diffusion length vs electron flux. 


diffusion length of an n-type base solar cell (1 2 em) during a measure- 
ment with 1 Mev electrons for an exposure of 10!° em~ when the initial 
L value is 100 uv. Differentiation of (6) yields 

dh «K 


2 


which gives, for the above case, a change of less than 0.2 per cent. 


VII. CONCLUSION 


It has been demonstrated that the method for measuring diffusion 
length by means of ionizing radiations is particularly suitable for the 
case of the solar cell and is expected to be useful for devices of similar 
geometry. Measurements with three types of radiations—electrons, pro- 


Taste TV — Rapiation DAMAGE PARAMETER K = (d/d®)(1/L?) For 
SoME RADIATIONS AND CONDUCTIVITY TYPES IN SILICON 








Base Conductivity Type 1 Mev 16.8 Mev 130 Mev 
and Resistivity Electrons Protons Protons Co y-rays 
p-type Xx 107% 8.3 X 10-7 | 3.8 10-7] 2.6 X 10°31em= 


190711 


1Qe 1.8 
p-type | 102¢ 5.8 X 
n-type | 1Qcm | 2.6 X 1079] 5.1 X 107°| 2.0 X 10°§) 4.0 K 107% 4em7? . 
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tons and gamma rays—are shown to give consistent results over a wide 
range of diffusion lengths and for both conductivity types in silicon. 
The electron beam technique is shown to lend itself most readily to an 
absolute calibration and to routine measurements of diffusion length. 
By this technique the average specific ionization of electrons as a func- 
tion of penetration depth is measured. For 0.98 Mev electrons in silicon 
this ionization reaches a maximum value of 225 pairs/u + 5 per cent 
at a depth of 0.096 gm/cm?. 

The consistency found among the various types of radiation allows 
one to reverse the argument and suggest that calibrated solar cells be 
used as solid-state ionization chambers for measuring radiation intensity. 
A range of 103 rad/hr to 10’ rad/hr should be achievable. 
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APPENDIX 


For a solar cell of the type shown in Fig. 1, for which the junction 
depth is much smaller than the diffusion length of the minority carriers, 
one needs to solve the diffusion equation only in the semi-infinite region 
beyond the junction for the case of penetrating radiation. If the junction 
is assumed to be located at x = O and the carrier generation rate as a 
function of distance from the junction to be given by g(a), then the 
steady-state excess carrier concentration will be given by the solution 
of the equation: 


pom 2 @ g(x) (8) 
dg? + ae 
where 
D = minority carrier diffusion coefficient, 


7 = minority carrier lifetime, 
nm = excess minority carrier concentration. 


Under short circuit condition, i.e., zero bias on the junction, the equa- 
tion is solved with boundary conditions n = 0, « = 0, and dn/dx = 0, 
x = o. If the carrier generation rate is reasonably uniform within a 
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diffusion length, LD = +~/Dr, of the junction, then g(x) may be repre- 
sented by the first two terms of its Taylor series expansion as follows: 


g(x) = go + giz. (9) 


The solution to (8) with g(x) given by (9) under the above boundary 
conditions is given by: 


n(x) = gr(1 — @ 7”) + gar (10) 


which yields for the magnitude of the short circuit current density: 
Joc = egol ( a - L). (11) 
0 


From (11) it may be seen that under strictly uniform carrier generation, 
i.e., g: = 0, the short circuit current density is given by Js. = egol. 
This magnitude is increased or diminished in the nonuniform case by 
an amount eg,L depending on whether the generation rate increases or 
diminishes with depth in the material, i.e., depending on whether g, is 
positive or negative. 
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Imperfections in Lined Waveguide 


By H. L. KREIPE and H. G. UNGER* 
(Manuscript received May 11, 1962) 


Thickness variations of the lining and deformations of the cross-section cou- 
ple circular electric waves to unwanted modes and degrade transmission char- 
acteristics. Generalized telegraphist’s equations for lined waveguide with 
these imperfections are found. The most critical interaction is caused by 
lining variations of circular symmetry between TE, and higher circular 
electric waves. Because of such interaction the average TE loss ts increased 
and signal transmission is distorted. More serious than signal distortion 
is the increase in average loss. In 2” I.D. copper waveguide with a 0.01” 
lining, the rms of a typical thickness variation should stay below 0.002” for 
the TE, loss at 55.5 kme not to be raised more than 10 per cent. Cross-sec- 
tional deformations in lined waveguide cause nearly the same increase in 
loss and signal distortion as in plain waveguide. Tolerances for such defor- 
mations should therefore be the same as in plain waveguide. 


I. INTRODUCTION 


Lined waveguide shows promise as a communication medium.! Cir- 
cular electric wave loss in bends is reduced by a low-loss lining. A lossy 
lining reduces the degrading effects of mode conversion and reconver- 
sion. 

In straight circular waveguide with a perfectly uniform lining, circu- 
lar electric waves will only suffer a slight increase in attenuation due to 
the loss factor of the lining and the slightly increased wall currents. 
Also, the phase constant will be only slightly shifted. Otherwise the 
transmission characteristics will remain smooth and undistorted. 

A perfect waveguide with a perfectly uniform lining, however, can- 
not be realized in practice. The waveguide will be slightly deformed, 
and any lining as it is applied by spraying, dipping, or other methods, 
will show variations in thickness along the circumference as well as 
longitudinally. These variations will, in general, be distributed ran- 
domly. 


* Technische Hochschule Braunschweig; work done under letter contract. 
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Deformations of the cross section or an imperfect lining couple the 
circular electric wave to other unwanted modes and by mode conversion 
and reconversion further increase the loss and otherwise degrade the 
transmission characteristics. To keep these degrading effects small, the 
waveguide has to be made sufficiently round and straight and the lining 
uniform. In order to specify tolerances, a theory is needed of circular 
electric wave propagation in a deformed waveguide with varying thick- 
ness of the lining. 

Attempts at such a theory have been made elsewhere, but they were 
limited to a first-order approximation’. These approximations are 
by far not adequate to describe the cases of practical interest. 

Imperfect lining in a perfect waveguide will be analyzed first. Subse- 
quently, the effects of cross-sectional deformations will be taken into 
account. 


Il. GENERALIZED TELEGRAPHIST’S EQUATIONS FOR A WAVEGUIDE WITH 
IMPERFECT LINING 


The lined waveguide, Fig. 1, will be considered in cylindrical coordi- 
nates (7, v, 2). The thickness ¢ of the lining will for the moment be as- 
sumed a function of g only. Any such function, being periodic in g, may 
be expanded into a Fourier series 


t= 401 + Tv», cos qg). (1) 
qa 


The sine terms have been omitted in (1); they would only add terms of 
orthogonal polarization. t, is the nominal thickness of the lining. 
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Fig. 1 — Round waveguide with lining of varying thickness. 
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The relative permittivity will be written 
euch 


where ¢; is the relative permittivity of the waveguide with a perfectly 
uniform lining. e. takes into account the actual permittivity distribution 
in a waveguide with varying thickness of the lining. 

More specifically: 


7 ie 1 0 < r < ao 
(2) 
€) = €; ios T << O 
where e, is the relative permittivity of the lining. Moreover, 
€: + €& = & 5 €é2 = €-— 1 inareas A (Fig. 1) 3) 
3 
gte=1; a i in areas B. 


In other cross-sectional areas eg = 0. Only the thickness of the lining 
varies; the permittivity is assumed to be constant. 

The electromagnetic field in the waveguide is described in terms of 
normal modes of the round waveguide with a perfectly uniform lining. 
These modes are derived from two sets of scalar functions 7’, and 7, 
given by (6.1) and (6.2).* 

The transverse field components are written in terms of the follow- 
ing wave functions: 








/ 

B, = 2Ve[ 2% + a, | 

n or roe 
E, = V_| oe - iy Te | 

n rog or (4) 

OT hn OT n. 
A, _ -21,| 9% a dn “73 or iz 
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The individual terms in (4) represent normal modes of the lined wave- 
guide when d, is chosen according to (6.26). k = w+/poeo is the free- 
space wave number; h,, is the axial propagation constant; and 


kn = Xn@o = ao(k? — h,?)* (5) 
is the radial propagation constant of the mode n. 


* These equations are listed in Ref. 6; the terminology (6.1) refers to Ref. 6 
equation (1), (6.2) to Ref, 6 equation (2), ete. 
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In Maxwell’s equations the distribution of permittivity (e + ¢) 
must be taken into account: 


{2 (E.) — = (rE) | = —jouoll, (6) 
| 2 a) - 2 ce) | = —sonatt (7) 
2) 2 (nt,) — 2 cn) | = —sanott (8) 
e (H,) — 2 (ra,) | a iCreia aie (9) 
[ 2) - 2 aH) | = sola + weed, (10) 
2 GH - (H,) | = jole + eect. ay 


Substituting for the transverse field components from (4) into (8) and 
(11) and taking advantage of (6.3), the longitudinal field components 
are obtained: 





2 
Hz = jweo> Vide T.,! (12) 
€1 X : 
H, = Jeopty Z Ln Tear a (13) 


To find relations for the current and voltage coefficients, the series 
representations (4) and (12) and (13) for the field components are sub- 
stituted into Maxwell’s equations. Then by multiplying with orthogo- 
nal field functions, combining some of these equations, and integrating 
over the cross section, generalized telegraphist’s equations are obtained. 

For example, 

(i 2 lige OL iy, 
—\rée ak? ar 


) « times (6) 


is added to 
(2 4 ay Bat Oe 
or "ak? rap 


) €1 times (7) 


and the result is integrated over the total cross section. Using the or- 
thonormality condition (6.29)° and the wave equation (6.3), one obtains: 
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9 2 2 
Wee 5 = Ty = jeje 2 Tq. | OE Pas (14) 
n Ss 


dee ek eee 
Similarly, 
, 
— (% + dn a) times (9) 
or Trdg 
is added to 


t 
~ (te — dn it) times (10) 
Trdg or 


and the result is integrated over the cross section 


dm + jweoVin 
dz 


OT , a7, lier: OT m. 
= —Jeren 2 Vn fie | ar + dn | E + din oF. | (15) 


dT n dTn | [ OT m re. 
a E ids ror | | = ~ dm or =: 

















Iiquations (14) and (15) are generalized telegraphist’s equations for 
round waveguide with imperfect lining. 
Introducing traveling waves, 


Les =%V Kin (Gm + Dn) 


En Pa : (Gi, = Dn) oy 





- 


with the characteristic impedance 


hm im 
Bait 3 (17) 
WED 
the more convenient form of generalized telegraphist’s equations in 
terms of amplitudes of forward (a,,) and backward (b,) traveling waves 


is obtained: 


dz. + aan Can ae J 2 [em an Cmn ba] 

adm + a : 
> — — ; 

“ae a FlnD ai J - Gea Dn Cmn Gn]. 


The coupling coefficients in (18) are 
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: 2W/Ninlin 2S & ate 
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To analyze circular electric wave propagation, it is sufficient to con- 
sider only coupling between circular electric and other waves. Let m 
denote the TEom wave, then 














T= b 
and 
Ein = Erm = Hom = 9. 
In this case the coupling coefficients reduce to 


+ eile | [a = dy Ee | ohm as 


rag "ar F) (2) 





Cmn 


Also the generalized telegraphist’s equations may now be written 
shorter: 


ae + jin Om = JZ Cma(Gn + Bn) 
db . es 
Fe Fimbm = JZ Cnn(dn + bn). 


To find the z-dependence of the wave amplitudes a, and b» for cer- 
tain initial conditions requires the solution of the generalized teleg- 
raphist’s equations (18) or (21). They are a system of simultaneous 
first-order and linear differential equations and can be solved by stand- 
ard methods. 


III. COUPLING COEFFICIENTS FOR LINING IMPERFECTIONS 


First of all, the coupling coefficients have to be evaluated. Under prac- 
tical conditions certain approximations may be made. The maximum 
deviation of the lining from its nominal thickness or also any of its 
Iourier components are assumed to be small compared to the nominal 
thickness: 


vy KI. (22) 
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A first approximation is obtained by substituting for the wave func- 
tions 7’, and T',’ in the range of thickness deviations their values at 
the nominal boundary r = a). Thus the integration in (20) is facilitated. 
e2 is different from zero only in regions A and B of Fig. 1, and in these 
regions the integrand is assumed independent of r. 

According to the boundary condition at r = ao for the internal and 
external field components 


4 4 e e€ 
Kg, Eon a Kg, Bon 


and consequently, because of (4): 




















aT’, of, ers” OT nm’ af, ota” 
nr — 7 nr m — ™m a a nr m 
E ‘ or | or -& or | or (28) 
The various terms of the integrand in (20) are: 
Of > p 
a Nod p(Kn) ay 08 PP 
u 
oa = Naxndp (kn) cos pe (24) 
iv 
OT 'm = Ninxmdo (Km). 
or 


Introducing these terms into (20) and using (23), the coupling co- 
efficients are: 


Cnn = 4 Aiimbin ImNnN nkemd1 Km) F phn) (p-dnkn'Yn) 


Q7r b—t (25) 
-(e, —1) i | [ cos py drdy 
Qo “0 “ao 


=. da ie) 
where Yn = Kind p(n) aa (26) 


Integrating over the radial coordinate r, the upper limit b — ¢ is a 
function of g. According to (1) 


b—t = a — 2 vy COS Ge. (27) 
qd 


Integrating in (25) over r and g the contributions from all Fourier 
components of (27) disappear except those with gq = p. One obtains for 
the coupling coefficients: 


Cnn = — 5 Vlnlin ImN mN nkimI skim) J phn) (p — dnkn Yn) 


- (e ee 1) dv, — 


1596 THE BELL SYSTEM TECHNICAL JOURNAL, SEPTEMBER 1962 
where for the nominal thickness the relative measure 


has been introduced. 

For a thickness deviation described by a single coefficient v, , there is 
only coupling between circular electric waves and waves of circumferen- 
tial order p = q. 

Yor the term corresponding to gq = 0 the integration results in 


Cnn = — TA NmlindmN mdnN nkimi nd 1(Km)J (kn) (€r — 1)81%0. (30) 


This uniform thickness deviation only causes mutual coupling between 
circular electric waves. 

For a very thin lining, the modes of lined waveguide may be considered 
perturbed modes of plain waveguide. It has been found elsewhere’ 
that a first-order perturbation of this kind is a good approximation 
only in a very limited range. Nevertheless it will be quite informative 
to study the approximations for the various expressions when this 
first-order perturbation is introduced. 

The asymmetric modes are either perturbed TE or perturbed TM 
modes. The coupling coefficient between circular electric and perturbed 
TE modes reduces to 


2 a 


2 P 
kK Kmokno i — es oe 
Kno? ae (1 Us Kb? (e, — 5) (31) 


Cmn = = €; 1)5;°v, 
V 2hmohino ‘3 


where the second subscript 0 indicates the value of the corresponding 
quantity in plain metallic waveguide. 
The coupling coefficient between circular electric and perturbed TM 


modes reduces to 
on = 4/ eo ss kkmop ae 6:°y,. (32) 


The uniform thickness deviation causes coupling only between circular 
electric waves. For a very thin lining the corresponding coupling co- 
efficients (80) reduce to 





k?kmokno 


eee as 3 
Cmn A/Tmolind (e, 1) 6, Vo. (33) 
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In all approximate expressions (31) to (83) the coupling coeffi- 
cients vary with the third power of the relative thickness 6, of the lining 
and are proportional to the relative deviation v, of the thickness. 

To evaluate the exact expressions (28) and (30) for the coupling co- 
efficients as well as the approximate expressions (81) to (83), the prop- 
agation characteristics of the coupled modes must be known first. The 
characteristic equation for the normal modes in lined waveguide of 
perfect geometry has been solved numerically by an iterative procedure. 
The evaluation was programmed for automatic execution on a digital 
computer. 

Also included in this program was an evaluation of the exact formula 
(28) for coupling between TE; and any asymmetric mode. 

For the typical values b/X = 4.70 and 7.62 corresponding to 2” I.D. 
waveguide at 55.5 kme and 90 kme, and for a permittivity e, = 2.5, the 
phase constants of a number of modes are plotted versus the relative 
thickness of the lining in Figs. 2 to 5. 

The lining changes phase constants most effectively for TE,: modes 
and all TM,, modes. All TE,, modes with n > 1 show very little 
change for a thin lining. First-order approximations for the phase con- 
stants in case of very thin lining would be represented in these plots 
by straight lines tangent to the curves at 6, = 0. Note that these first- 
order approximations are gravely in error for most modes and any sub- 
stantial thickness of the lining. 

Some of the coupling coefficients for thickness deviations have been 
plotted in Figs. 6 to 9 as a function of the thickness of the lining. 
In the log-log plot of these figures the first-order approximations for 
thin lining appear as straight lines with slope 3. Note that these straight 
lines are fairly good approximations for coupling between TE), and 
TE,, modes with n > 1 and a relative thickness of the lining smaller 
than 1 per cent. For coupling between TE, and all other modes, how- 
ever, these approximations are again seriously in error. 

Note also that coupling between TE, and TE,, modes is always 
larger than coupling between TI; and TM,, modes. Iurthermore, 
coupling between TE»; and TE,, increases with increasing order n of 
the radial dependence of coupled modes, while it decreases in case of 
TM,» modes. 

In Fig. 10 the coefficients of coupling between TE; and higher circular 
electric waves for circular symmetric components of lining imperfection 
corresponding to p = 0 have been plotted. First-order approximations 
for thin linings are shown in this diagram only. They are adequate for 
the entire thickness range to be considered. As in the general case of 
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Fig. 2 — Phase constants of TE;, and TMi, modes in lined waveguide. b/A = 
4.70, er = 2.5. 


coupling to TE,, the coupling coefficient increases with the radial order 
n of the higher circular electric modes. 


IV. MODE CONVERSION AT IMPERFECT LININGS OF VARIOUS PERMITTIVI- 
TIES AND AT VARIOUS FREQUENCIES 


Lined waveguide will be used for circular electric wave transmission 
over wide frequency bands extending up to 100 kme and more. In 
addition, for technological reasons the lining might be made of materials 
of various permittivities. 
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Fig. 3 — Phase constants of TE:, and TM2, modes in lined waveguide. b/A = 
4.70, 6 = 2.5. 


A first indication as to how the coupling coefficients depend on fre- 
quency and permittivity of the lining is obtained from the approximate 
formulae (31), (82) and (33) for coefficients of coupling at lining imper- 
fections. 

For interaction between waves which are sufficiently far from cutoff 
we have 
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Fig. 4 — Phase constants of TE3, and TM;, modes in lined waveguide. b/A = 
4.70, er = 2.5. 


hmo = hno =k 
and 
(er — 1) ka? > kyo’? 
so that for interaction between Tlic, and TE,, (31) may be replaced by 
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DIFFERENCE IN PHASE CONSTANT TO TE,,, Ab 





: ue 5 eee constants of TEin and TMyn modes in lined waveguide. b/x = 
. > €r = wv. 
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vB (i f eC — 1) ar"bp. (34) 
Keno? 

Equation (32) for interaction between TE», and TM,, may be replaced 

by 





Cnn = 


Gus Paige (35) 


V2 er 


Equation (33) for interaction between circular electric waves may be 
replaced by 





Cnn = Kmok nok (€r _= 1) 6,30. (36) 


These approximate expressions indicate a linear dependence on fre- 
quency of all coupling coefficients. As functions of the permittivity the 
coupling coefficients are proportional to (e, — 1)/e, for interaction be- 
tween TEo, and TM,, waves, but nearly proportional to (e, — 1) for 
interaction between TEo,, and all TE waves. The term p?/e,kKno? may for 
most TE modes be neglected with respect to unity. 

For substantial linings the normal modes cannot be regarded as per- 
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Fig. 6 — Coefficients cn, of coupling between TEo: and TE;, and TMi, waves 
at lining imperfections of order p = 1. 6/A = 4.70, e, = 2.5. 


turbed modes of plain waveguide. In this case an indication of the 
dependence of mode interaction on permittivity and frequency can be ob- 
tained only by evaluating the exact expressions. 

Comparing the curves of Figs. 6 and 9, it is found that the exact 
values for coupling coefficients show nearly the same dependence on 
frequency and permittivity as indicated by the approximations. Only 
for interaction between TE, and lower-order TM modes are the cou- 
pling coefficients appreciably larger at higher frequencies even for thin 
linings. 
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Fig. 7 — Coefficients c,,, of coupling between TE: and TE2, and TMo, waves 
at lining imperfections of order p = 2. b/A = 4.70, er = 2.5. 


All these characteristics of coupling due to imperfect linings will have 
to be considered when the coupled line equations are solved for TE 
propagation. 


V. LINING TOLERANCES 


In manufacturing lined waveguide, tolerances for irregularities must 
be specified. These irregularities are randomly distributed, and at best 
some of their statistical properties are known. By solving the generalized 
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Fig. 8 — Coefficients cmn of coupling between TE: and TEs, and TM3, waves 
at lining imperfections of order p = 8. b/A = 4.70, e = 2.5. 


telegraphist’s equations, statistics of the transmission characteristics 
may be expressed in terms of the statistics of lining imperfections. 

In particular, the average added loss for circular electric waves due 
to mode conversion at lining imperfections is’ 


L 
<Aan> = Pa) R(z)(L — 2) Cnm’? COS ABnme dz (37) 
n 0 


where 
R(z) = <vp (21) : Vp (21 + 2z)> 


is the covariance of a component »,(z) of thickness deviation. »,(z) is 
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Fig. 9 — Coefficients cma of coupling between TE: and TE;, and TM), waves 
at lining imperfections (p = 1). b/d = 7.62, ¢, = 2.5. 


assumed to be a stationary random process in 2. Crm isa coupling factor 
defined according to 


Cam = CamVp(Z). 


L is the total length of the waveguide. A, is the difference in phase 
constant between the circular electric wave m and the mode n coupled 
by Cam to m. The difference in attenuation Aa, between mode m and 
n is assumed small enough so that 


eA anm? ~ 1 me 


for all z for which R(z) has any substantial values. | 
The covariance will be assumed to drop off exponentially 
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lz| 
Re) = <p, > ry (38) 


where Ly is a correlation distance. Substituting (38) for R(z) into (37) 
and performing the integration for Io « L the average added loss is: 


Dal Le 
SES ae Kaci 


To evaluate (39) for the average added loss in the range of relative 
thickness 6 to be considered here, coupling to all TE, which are propa- 


(39) 


7 anit Tika 
Hy — 
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Fig. 10 — Coefficients em, of coupling between TE and TE», waves at lining 
imperfections of circular symmetry. b/A = 4.70, e = 2.5. 
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gating must be taken into account. Of the coupling to TM,, modes 
only lower-order modes with n < 4 need be taken into account. The 
contribution of coupling to TM,, modes with n => 4 to the average 
added TEq: loss is small enough to be neglected. 

Tor the coupling to TE,, modes, first-order approximations for phase 
constants and coupling coefficients may be used in case of higher-order 
modes with n = 4. The contributions from coupling to these higher- 
order TE,, modes to the average TEy, loss is small enough so that small 
errors in these approximations will cause no appreciable error in the 
final result. 

For a numerical evaluation of (39) the phase constants of Figs. 2, 3 
and 4 and coupling coefficients of ligs. 6, 7, 8 and 10 have been used. 
Yor higher-order TE,, modes the approximate expression (31) is ade- 
quate. Interaction between circular electric waves at lining imperfections 
with no circumferential dependence may be described for all modes by 
approximation (33). 

As a result of numerical evaluations, the diagrams in Figs. 11 and 12 
have been drawn. They show as a function of the correlation distance 
Lo the rms value of vp , the particular component of the thickness devia- 
tion, which by itself would increase the average TE; loss by 1 per cent 
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Fig. 11 — TEo, loss in lined waveguide with random lining imperfections of 
exponential covariance: 2-inch inside diameter, at 55.5 kme, lining ¢, = 2.5,6 = 
1 per cent. 
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Fig. 12 — TEo: loss in lined waveguide with random lining imperfections of 
exponential covariance: 2-inch inside diameter at 55.5 kme, lining e, = 2.5, 6 = 
0.5 per cent. 


in Fig. 11 and 0.1 per cent in Fig. 12 of its value in perfect copper wave- 
guide. aes 

All curves show a minimum. Among all the v, the strictest tolerance 
is imposed on vo by the interaction between circular electric waves. In 
this case for the critical correlation distance Ly) = 0.6” the rms of thick- 
ness deviations of a lining of 6 = 1 per cent should, according to Fig. 
11, be less than 6 per cent. In absolute values the 0.01” thick lining 
should be made uniform to +0.0006”. 

When the lining is thinner, the tolerances are considerably eased. Ac- 
cording to Fig. 12 and noting that the extra loss varies as the square of 
the thickness deviations, a 0.005” thick lining needs only to be made 
uniform to +0.0025” for the circular electric wave interaction to in- 
crease the average TE, loss not more than by 1 per cent. 

Mode interaction at lining imperfections not only increases the aver- 
age TE, loss, but also degrades the transmission characteristics which 
in perfect waveguide would be smooth functions of frequency. Signals 
transmitted through imperfect waveguide will suffer amplitude and 
phase distortion. 

A radio-frequency pulse of rectangular shape might be considered a 
standard signal. Mode conversion-reconversion effects will cause recon- 
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version forerunners to precede the output pulse and tails to follow it.’ 
If too large in amplitude, these forerunners and tails will in a PCM sys- 
tem produce errors in regeneration. 

The rms amplitude of the reconversion tail is largest after a long 
signal pulse. Immediately after this pulse, the contribution to the rms 
value by reconversion from one coupled mode is given by® 


/ FOP = Cyn? e(ABoen) / os (40) 


In (40) Aemn is the difference in attenuation constant between TEom 
and the coupled mode n. ¢ is the spectral distribution of », given in 
terms of the covariance by 





ole) = [Rie ate (41) 


For the exponential covariance of (38) the spectral distribution is 


4arlio 


ie + PLS’ (42) 


g(t) =<vy> 
The contributions to the reconversion tail from different modes are 
caused by components of the spectral distribution g(£) at the corre- 
sponding = A3,,, . From one coupled mode to the next these are quite 
different spectral components. Their contributions might therefore be 
assumed to be uncorrelated. Then the total contribution to the rms 
value is obtained from the sum of the squares of each single contribu- 
tion: 
L 


2 es OV (A ae 4 
| de | 2 Cnn g ( Brin) 2Adinn ( 3) 





The most critical signal distortion will undoubtedly be caused by 
circular electric wave interaction at lining imperfections of order p = 0. 
Not only does this interaction increase the average Tim loss most 
strongly, but higher circular electric waves are also propagating with 
extremely low loss. They will, therefore, contribute terms to the sum 
of (43) which are large because the denominator Aam, is small. Signal 
distortion due to circular electric wave interaction should therefore be 
considered first. 

2 

In Fig. 13 the quantity Lae has been plotted as a function of the 
correlation distance Ly . Mode interaction between TE, and TE con- 
tributes most strongly to the reconversion tail. Therefore the curve has 
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a maximum where the TEo,; — TE: term in (43) has its maximum value. 
Since in (42) the spectral distribution assumes its largest value when 


Les +, the terms in (48) will show their largest value when 


20 
ABmn 


that is, when the correlation distance is equal to the beat wavelength 
between the two coupled modes. In Fig. 13 the maximum occurs at the 
beat wavelength between TE and TE: . 

For this most critical spectral distribution of random imperfections 
the total reconversion tail is still quite small. For example, let +/72 = 
0.2 be the rms of thickness deviations; then for a waveguide length of 
L = 20 miles the rms of the reconversion tail amplitude is still more 
than 30 db down compared to the signal pulse. Tolerances on lining im- 
perfections, therefore, need not be as strict for signal distortion as 
they must be for the increase in average Tl; loss. In the present ex- 
ample the increase in average TE loss would be nearly 4 per cent. 

At higher frequencies an increase of mode conversion is indicated by 
the linear frequency dependence of coupling coefficients according to 


Le= 
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_ Fig. 13 — TE — pulse distortion in lined waveguide with random imperfec- 
tions of the lining. Mode conversion causes a reconversion tail of rms amplitude 


V la? to follow the pulse. b/\ = 4.70, e- = 2.5, 6 = 1 per cent. 


IMPERFECTIONS IN LINED WAVEGUIDE 1611 


(34), (35) and (36). In addition the beat wavelength between coupled 
modes increases linearly with frequency. The coupling coefficients will 
also be larger for higher permittivities of the lining. 

For operation at higher frequencies or with linings of larger per- 
mittivities, tolerances will have to be chosen correspondingly tighter. 


VI. GENERALIZED TELEGRAPHIST’S EQUATIONS FOR DEFORMED WAVEGUIDE 


Cross-sectional deformations of the lined waveguide will couple cir- 
cular electric waves to unwanted modes and like imperfections of the 
lining degrade the transmission by mode conversion and reconversion. 

To analyze wave propagation in deformed waveguide, a wall imped- 
ance representation will be used to formulate the boundary conditions 
at the surface of the lining.® In perfectly normal waveguide the bound- 
ary conditions of a perfectly uniform lining are 


E, = —Z.H, (44) 

E, = Z,Hz. (45) 

Small deformations cause the radius a of Fig. 14 to be a function of ¢, 
a = all + o(¢)). (46) 


Any function ¢o, being periodic in yg, may be expanded into a Fourier 
series: 


a = aol + Dc, COs gy]. (47) 
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Fig. 14 — Lined waveguide with cross-sectional deformation. 
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Sine terms have been omitted; they would only add terms of orthogonal 
polarization. The deformation is assumed to be small and smooth: 
de 
¢<1, ip <1 (48) 

For the deformed waveguide the boundary conditions will now have 
to be satisfied at r = a and not at the nominal radius ap . In (44) and 
(45) the tangential components of the magnetic field require the tan- 
gential electric field to have a certain value. The magnetic field may be 
considered to excite the electric field. In the deformed waveguide this 
excitation stays nearly the same, but due to the slight displacement of 
the wall it now occurs at r = a. 

The electric field at r = a can by Taylor series expansion be written 
in terms of the field at r = ao. Neglecting higher-order terms 


se 





E(a) = E(a) + (49) 

and the boundary conditions are: 
Bea) + SEG agg = — 2-H 6a) (50) 
Bylo) + BHA ayo + By(as) 2 = ZeHs(aa). (51) 


Maxwell’s equations for r < a are given by (6) to (11) with eg + 

= 1. Also, the representation of the transverse field components for 
ry < a in terms of normal modes of the perfect lined waveguide is 
as in (4). Substituting for the transverse field components from (4) into 
(11), the longitudinal electric field is obtained: 


2 
= jw DI, X* L ycs5 (52) 

n k? 
The longitudinal magnetic field, however, cannot be obtained from (8) 
since the series expression for /, in (4) is nonuniformly convergent and 
must not be substituted for differentiation into (8). We only substitute 
for E, from (4), multiply (8) by Tm’, and integrate over the cross section. 

After partial integration: 


20 


cdi | H,T,' dS = | EyTp'ady | (53) 
s 0 





27 , 
-zV.] Tr | Te | aa de + 2 Vad | to a8. 
0 roe 0 
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To find relations for the current and voltage coefficients, substitute 
(4) for the field components into Maxwell’s equations. Then add 
ot; lg ote 


rs + din 7 ap times (6) 


and 


OTs, fla. 4% 
Or + din We Tap times (7) 





and een over the nominal cross section. The result is 


ne a+ = m= | (grad E,) (grad T,) dS 


Tee. , 
+ dy I (grad H,) (flux Tr”) dS 
(54) 
— Joona 2 peo =| f (grad 7T,,) (grad 7',) dS 


Ine 
+ dq im i lerid: TV thie Pas. 


After partial integration on the cracls side of (54), 


dV mn jeer w= [eB B, (f= 4 dm Pom OT a 
dz or ve ie Op. we 


ae 27 
OT! : Os lig OL x, 
Seay ind dS — jeouo D In 7 72 lf ty (2m 4 de hn ee oY as de 





Sis 8 l TT! is]. (55) 


In special cases when the lining is very thin or when there is no lining, 
the individual terms for #, in (52) are zero for r = ay, while E, itself, 
because of the boundary condition (50), is different from zero. Then 
(50) is a nonuniformly convergent series, which describes H’, only in 
the open interval 0 < r < ay. Term-by-term differentiation will make 
the series diverge. Therefore the series had not been substituted for E, 
in (54). In (55) it may now be substituted in the integral over the cross 
section. In the line integral, H, from the boundary condition (50) may 
be substituted. Thus, instead of (55), 


Vm 4 ihm _ _g [O° cag | OF din tins OF 5 
dz mee : or | ar a k? dy 





Pe de (56) 
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Similarly, add 





/ 
— (te + dm oe times (9) 
or r dy 


/ 
_ (Of — 4, 2m 
r dy 


re) times (10) 


and integrate over the cross section. The result is 


27 1 / 
dis ieaveostn) ae (Me mag) a 
0 








rag or 
2 
bd / XH Tn DS + josey E Vndn > (57) 
Ss n v 
20 a] 
[a Fe ee at din dn tm) dy = din en Ta A | . 
0 rog or 
The boundary condition L,, = Z,H., of the perfect waveguide may be 
oT, oT,’ . ~6 
used for the normal mode term {—~ — d, ——~} in (53) and (57), and 
Trdg or 
(53) may be substituted into (57). The result is 
27 
om oF joeoV m —Joeododm Fr Xo ‘ee [E, i ZH] dg. (58) 





Introducing the boundary condition (51) for #, we get instead of (58) 
2 Qa 
Un + Jue.Vin = —Jweododm |, ae ce ous +2 a ae. (59) 
dz k? Jo 
Partially integrating the last term under the integral 


‘a 19! © Bade = —[ “| rp! + 1,2 | ap 


1 / 
and substituting for #, and £, from (4) and for E a —d, oe | from 
y ") 











the boundary condition in perfect waveguide, the other set of generalized 
telegraphist’s equations is obtained: 


Ken em 


WpLoAo 


ody — j2 Vath 








Aieo 2 : 
Tet ween = dare [ ne 
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: (60) 
7 . Ze ? , 
r a Pe Tm ody. 
0 WpLodo 





IMPERFECTIONS IN LINED WAVEGUIDE 1615 


The interest is limited here to propagation characteristics of circular 
electric waves. Therefore, only terms that describe direct interaction 
between circular electric and other waves need to be retained in (56) 
and (60). When V,, and J, are amplitudes of other modes, then £, , 
H, and £E, of circular electric waves are zero. Thus (56) and (60) reduce 
to 

ae 


"+ j-* In = 0 





(61) 





27 
tn, jem = — JE Vath Kahn (1 i.) [ ot,'Ta’de, 
dz AyTIN Ei AoW Lo 0 
Introducing traveling waves as in (16) and (17), the more convenient 
form (18) of generalized telegraphist’s equations is obtained. 
The coupling coefficients are 








c * = 5 Via dndm Bn im (1 - elf Tn Tm de (62) 
nm nllm ae Py J kao ‘ Tin Lm UP 
where z, = ~/e/uo Zp is the wall impedance with respect to free 
space. 
Substituting from (6.1) for 7, and 7,’ and from (47) for o(¢), 
er al : 
Camt = 5 Vhahin dn ae NaN halts hn) ep (: ida a), (63) 


A component op causes coupling only between circular electric waves 
and waves of circumferential order p. 

The wall impedance in (63) may for all cases of present interest be 
approximated by: 





tan kapv/e, — 1 4. (64) 


pe 1 

° WV —-]1 
For a very thin lining the expression (63) for the coupling coefficients 

reduces to the coefficient for coupling at the corresponding deformation 

in plain metallic waveguide. We obtain c,m = 0 for interaction be- 

tween TE o, and TM,, . For interaction between TE, and TE,, the 

coupling coefficients are 


Kmokno kno 
do V/2hmohno WV kno? ~— 2 
The numerical evaluation of the general expression (63) has been in- 


cluded in the program for automatic execution. Typical results have been 
plotted in Fig. 15 for a waveguide with a continuous axial offset (p = 1), 


C3 (65) 


Cloml[pn] = 
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Fig. 15 — Coefficients cmn of coupling between TE, and TEy, and TMi, in 
axially offset lined waveguide. b/A = 4.70, er = 2.5. 


in Tig. 16 for elliptical deformation, and in Fig. 17 for trifoil deforma- 
tion. 

In all three cases the coupling coefficients have the following charac- 
teristics in common: The coupling between TE, and TM waves is 
much weaker than the coupling between TE»; and TE waves even for a 
substantial thickness of the lining. For a particular deformation the 
coefficient of coupling between TE, and TE,, waves increases with the 
index n of the waves. Higher-order waves are coupled more strongly. 
The relative change of coupling between TE, and TE,, with the thick- 
ness of the lining is only slight in particular for coupling to higher-order 
waves. 

The most significant consequence of these general characteristics is 
that mode conversion effects due to coupling at waveguide deformations 
will be nearly the same in lined waveguide as they are in plain wave- 
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THICKNESS OF LINING IN PER CENT OF RADIUS 


Fig. 16 — Coefficients ¢mn of coupling between TEa and TE», and TMs, in 
lined waveguide with elliptical deformation. b/A = 4.70, 6 = 2.5. 


guide. Lined waveguide should therefore be manufactured to the same 
cross-sectional tolerances as is plain waveguide. 


VII. CONCLUSIONS 


TE; transmission in lined waveguide is degraded by thickness varia- 
tions of the lining and cross-sectional deformations. These imperfections 
couple the TE; wave to unwanted modes. The coupling to higher cir- 
cular electric waves at thickness variations of circular symmetry is 
strongest. Asymmetric thickness variations couple TE: to the corre- 
sponding asymmetric modes. 

Random thickness variations increase the average TE loss and cause 
signal distortion. The increase in loss is much more pronounced than 
signal distortion. In a typical example of random thickness variations of 
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Fig. 17 — Coefficients ¢mn of coupling between TEq and TE;, and TMs, in 
lined waveguide with trifoil deformation. b/\ = 4.70, er = 2.5. 


a 0.01” lining of « = 2.5 in a 2” I.D. waveguide at 55.5 kme, the rms of 
the symmetric component of this variation should remain smaller than 
0.002” for the average TEo, loss, not to be raised by more than 10 per 
cent of its value in perfect copper pipe. Under these conditions a pulse 
signal after traveling through 20 miles of this waveguide is distorted 
only by a reconversion tail nearly 30 db smaller than the signal. The 
signal distortion, being caused by circular electric wave interaction, can 
not, however, be reduced by ordinary mode filters. 

Cross-sectional deformations in lined waveguide cause nearly the 
same interaction of To with unwanted modes as corresponding defor- 
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mations in plain waveguide. Lined waveguide should be manufactured 
to the same cross-sectional tolerances as plain waveguide. 
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Characteristics of the Service Provided 


by Communications Satellites in 
Uncontrolled Orbits 


By J. D. RINEHART and M. F. ROBBINS 
(Manuscript received May 22, 1962) 


In a communications system which uses satellites in uncontrolled orbits, 
there would be times when the communication between two stations would 
be interrupted because no satellites would be in view. This paper provides 
material which can be used to describe the service which a random system 
of satellites would provide. An example system which could be used to pro- 
vide initial world-wide service is also discussed. This system has certain 
nonrandom characteristics, and the effect of these characteristics on the 
service to various parts of the world is examined by computer simulation. 
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I. INTRODUCTION 


One of the several types of systems which have been proposed for 
world-wide communications would use numerous satellites in circular 
orbits 3000-8000 miles* above the earth’s surface. The satellites would 
contain active repeaters through which any pair of stations on earth 
could communicate whenever a satellite is mutually visible to the sta- 
tions. In this system, the satellites would move independently of each 
other, and the motions of the satellites relative to the earth’s rotation 
would not be controlled. The manner in which the satellites are dis- 
tributed about the earth at a time chosen at random would be much 
the same as if the satellites were randomly placed into position at that 
time. 

Two ground stations will be able to communicate a large part of the 
time if a random system of satellites is used, but there will be times when 
no satellites will be mutually visible. This paper describes the service 
which satellites in random, or uncontrolled, orbits would provide. Serv- 
ice, in this context, refers to the availability of satellites for communica- 
tion and does not pertain to the type or subjective quality of communi- 
cation. Some of the material in this paper is an extension of a study 
reported earlier by Pierce and Kompfner.! 

Sections II and III describe the geometrical properties of satellite 
orbits and satellite visibility. Section IV presents some methods and 
results which have been used to describe the service provided by a 
random system. Section V discusses a particular system of satellites, 
which approximates a random system, and describes the service which 
would be furnished to points throughout the world. The nonrandom 


* The nautical mile is used throughout this paper. 
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characteristics which this system might have and the effect which these 
characteristics would have on the service are studied in Section VI. 


II. SATELLITE ORBITS 


This section describes some basic geometrical properties of satellite 


orbits. The discussions of service in later sections are based on the orbit 
properties defined here. 


2.1 Orbit Altitude 


The general orbit of an earth satellite is an ellipse which has the 
earth centered at one focus. Two quantities which are popularly used 
to describe the altitude of the elliptical orbit are the height of perigee 
and the height of apogee. As shown in Fig. 1, the perigee height is the 
distance of closest approach to the earth’s surface, and the apogee height 
is the maximum altitude reached by the satellite. The circular orbit, 
which is of prime interest here, is a special case in which the apogee and 


perigee heights are equal. The circular-orbit altitude will be designated 
by H. 
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Fig. 1 — Elliptical and circular orbits. 
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2.2 Satellite Period 


The period of an earth satellite, i.e., the time required for a satellite 
to move through 360° in its orbit, is given by 


Poa? a (1) 
| TA MG 

where M is the mass of the earth, G is the universal gravitational con- 

stant, and a is the semimajor axis of the orbit. The period is plotted as 

a function of H in Fig. 2. 


2.3 Orientation of the Orbit Relative to the Earth 


A satellite in an equatorial orbit moves in a plane which contains 
the earth’s equator, and the plane of a polar orbit contains the earth’s 
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Fig. 2 — Satellite period vs circular orbit altitude. The period of elliptical 
orbits is found by using a circular orbit altitude which is the average of the apogee 
height and the perigee height of the elliptical orbit. 
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rotation axis. The orientation of other orbits relative to the earth is 
defined by the inclination angle between the equatorial plane and the 
orbit plane. 

As shown in [ig. 3, the inclination angle is measured counter-clock- 
wise from the equatorial plane to the orbit plane. The direction of the 
motion of the satellite is important, so the measurement is specified at 
the point at which the satellite crosses the equator from south to north 
(the ascending node). Satellites in orbits with inclinations less than 90° 
move in the direction of the earth’s rotation. These orbits are easier to 
achieve than retrograde orbits (inclination larger than 90°) because of 
the inertial velocity added by the rotation of the earth. 

The plane which contains a satellite orbit remains fixed in space 
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Fig. 8 — Polar, inclined, and equatorial orbits. The orientation of the orbit 
plane relative to the earth is given by the inclination angle. 
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while the earth rotates under the orbit.* The orientation with the earth’s 
axis is constant, but the orientation of the orbit relative to a point on 
the earth continually changes. 


2.4 Relative Orientation of Orbit Planes and Relative Positions of Satellites 


The orientation of a system of several orbit, planes is not adequately 
described by the inclination angle, for the planes can have any relative 
orientation around the earth’s axis. This orientation will be specified by 
giving the angular position of the ascending nodes. The angle is meas- 
ured in the equatorial plane from an arbitrary reference. Thus, if the 
ascending nodes of three polar orbits are at 0°, 120°, and 240°, the 
planes are uniformly spaced. 

The position of a satellite in its orbit at a particular time will be 
given by a phase angle. This phase is the angular separation between 
the ascending node and the satellite. The angle is measured at the center 
of the orbit and is positive in the direction of satellite motion. A satel- 
lite over the north pole would thus have a phase of 90°. 

The relative positions of several satellites can be completely described 
by giving an altitude, inclination, relative ascending node, and phase 
for each satellite. In a random system, all satellites usually have the 
same altitude, and one or several inclinations may be specified. At a 
time chosen at random, all ascending nodes and phases would be equally 
probable. In some systems one degree of randomness is removed by 
specifying the relative ascending nodes. Also, several satellites with the 
same inclination may have the same ascending node and would there- 
fore be in the same plane. 


Il. SATELLITE VISIBILITY 


Satellite communication can take place between two ground stations 
only if a satellite is visible simultaneously from both stations. Thus, 
mutual visibility conditions determine the service which a system of 
satellites provides to the stations. 


3.1 Visibility Geometry 


A satellite is considered usable for communication if its elevation 
angle above the theoretical horizon is larger than some specified a.f This 


* This is true only for polar and equatorial orbits. The effect of the small motion 
of inclined orbits will be taken up in Section VI. 

} For convenience in terminology, the satellite will be said to be visible only 
when this condition is met. 
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angle limitation is influenced by local terrain, performance of the radio 
system,! and considerations of interference with terrestrial systems.? A 
value of 7.5° will be used for a in the studies presented here, since this 
value has been used in most previous discussions. 

The angle a defines a cone of satellite visibility as shown in Fig. 4a. 
The orbit of a satellite at altitude H intersects this cone in the two 
points at which visibility begins and ends. If the orientation of the orbit 
relative to the ground station is arbitrary, the locus of all possible ex- 
tremes of visibility is a circle on the cone. Projecting these points radi- 
ally onto the earth results in a circle with angular-great-circle radius y 
as shown in lig. 4b, where 


Vv = cos” (gts cos «) —a (2) 
A station could see a satellite which is at an altitude H whenever it is 
over this circular region. Two stations can communicate only if their 
circles of visibility intersect and form a mutual visibility region. 

Some mutual visibility regions for 6000-mile orbits are shown in Fig. 





(b) 


Fig. 4 — Satellite visibility geometry: (a) Plane view of the cone of visibility 
for a ground station at A. a is the minimum usable elevation angle. (b) Circle of 
visibility. A satellite of altitude H would be more than a degrees above the horizon 
at the ground station whenever it is over this circular region centered on the 
ground station. 
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5. Because of the polar projection used, the parts of the regions which 
are south of the equator are not shown. Satellites in polar orbits could 
pass over any part of these regions, but those with lower inclinations 
would not move further north than the latitude corresponding to the 
inclination angle. Thus if the motion of the satellites is also considered, 
the area of mutual visibility would be reduced for lower-inclined orbits, 
and it would degenerate to a line for equatorial orbits. 

Fig. 5 shows a region in which a satellite in a polar orbit could be 





Fig. 5 — Mutual visibility regions for 6000-mile polar orbits. a = 7.5°. Portions 
of the visibility regions below the equator are not shown on this projection. 
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seen simultaneously by all three pairs of stations. This region and the 
other regions of overlap present a problem of competition for the use 
of satellites which are over these regions. A discussion of this problem 
is given in Section 4.1.3. 


3.2 The Probability that a Satellite is Mutually Visible 


The description of random-orbit service requires a knowledge of the 
probability that a satellite in a particular orbit is mutually visible to 
two stations at a randomly chosen time. A numerical method is used to 
obtain an estimate of this probability, which will be designated as p. 


3.2.1 Numerical Determination of p 


Fig. 5 can be used to determine p= for Seattle and Tokyo for 6000-mile 
polar orbits. The earth rotates under the orbit, so at a randomly-chosen 
time the orbit plane can intersect the earth along any meridian. The 
meridian lines in the figure thus represent a uniform sample of all orien- 
tations of the plane. All phases are equally likely for the satellite at the 
chosen time. For a particular orientation of the plane, the fraction of the 
corresponding meridian line which is contained in the mutual visibility 
region is the probability that the satellite can be seen. The average of 
the fractions determined for all meridians in the sample is an estimate 
of p. 

For an equatorial orbit, p is simply the fraction of the earth’s equator 
which is contained in the visibility region. The value of p for the general 
inclined orbit was determined by a computer program which uses a 
method similar to the one described for polar orbits. A method which 
uses numerical integration of an expression for =p is given in Ref. 3. 


3.2.2 Variations in p 


For a given orbit altitude and inclination, p is a function of the lati- 
tudes of the two ground stations and the distance between them. Tor 
pairs of points whose mutual visibility regions include one of the earth’s 
poles, polar orbits provide a higher p than do other orbits. Regions which 
are centered on the equator are best served by equatorial orbits. Inter- 
mediate inclinations give the highest p for regions which are between 
these extremes. 

For a given pair of ground stations and a given inclination, p increases 
with the altitude of the orbit. An example of the nature of this increase 
is shown in Fig. 6 which gives p vs altitude for Maine—London for 
polar orbits. The rate of increase of p with altitude begins to diminish 
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Fig. 6 — Fraction of time (p) a satellite in a circular polar orbit of altitude H 
is visible to Maine and London. a = 7.5°. 


rapidly at altitudes above about 8000 miles. The reason for this is ap- 
parent from (2). For high altitudes in this equation, the radius of the 
visibility region is approaching (7/2)—a@ which is the maximum value 
which can be attained, and the size of the visibility region directly in- 
fluences the value of p. The maximum value of p for this pair of points 
is 0.368. 

Equation (2) shows that the size of the visibility circle is also influ- 
enced by the value of a. Thus the minimum elevation angle affects the 
value of p for a given orbit and a given pair of ground stations. Fig. 7 
gives p vs a for Maine-London and 6000-mile polar orbits. The 
value of p is read from the left scale. If a is zero, i.e., if the satellite were 
used whenever it was above the theoretical horizon, p would be 0.28. 
An a of 20° would result in a p of about half of this maximum value. 
So if a were 20°, the satellite could be used during only half of the time 
it is visible. The use of an @ of 7.5° results in about 82 per cent utiliza- 
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Fig. 7 — The effect of the minimum elevation angle on the usefulness of a 
satellite in a 6000-mile polar orbit. Ground stations at Maine and London. 


tion of the visible time, so there is something to be gained in reducing 
this angle if it is possible. 


3.2.3 A Partial Table of p 


During the course of previous studies, values of p have been deter- 
mined numerically for many ground-station locations, orbit altitudes, 
and inclinations. These values are given in the table in the Appendix. 
The table is by no means complete, but many of the pairs of points 
which are of interest are included.* 

The first two columns in the table are the colatitudes of the ground 
stations. These numbers may be measured from the north pole or the 
south pole of the earth as long as the same convention is used for both 
points. The third column is the longitude separation of the two stations. 


* Ref. 4 gives values of p in a more general form. 
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The other columns are headed by altitudes and inclinations. Linear 
interpolation may be used between successive altitudes and successive 
inclinations with reasonable accuracy. 


IV. SERVICE PROVIDED BY SATELLITES IN RANDOM ORBITS 


4.1 Quality of Service, q 


A random-orbit system will use a multiplicity of satellites to provide 
service to many pairs of ground stations. For a particular pair of sta- 
tions, the quality of service is defined as the fraction of time the re- 
quired number of satellites is available for use. The symbol gq is used to 
denote the quality of service. 


4.1.1 Satellites in Orbits of the Same Altitude and Inclination 


(2) Single-Satellite Visibility 

Since p is the probability that a satellite in a certain orbit will be 
visible to a pair of ground stations at a time chosen at random, (1 — :p) 
is the probability that the satellite will not be visible at that time. If 
there are n satellites with identical p’s, and if the satellites have random 
ascending nodes and phases, the probability that none of these satellites 
would be useful at a random time is (1 — p)”. The fraction of time at least 
one satellite would be useful is then 

Ge = 1 Ws p)* (3) 

Vig. 8 gives values of q: for a range of values of n and p. If p is known, 
the number of satellites needed to furnish a certain quality of service 
may be found. For example, for stations in Maine and London, and 6000- 
mile polar satellites, p = 0.23. For a quality of service of 0.999, Tig. 8 
shows 27 satellites would be required. For gq: = 0.99, n = 18; and for 
qi = 0.90, n = 9. 

Equation (8) may also be written 


_ log (1 — q) 
log (1 — p) 
If n is changed to n’, gq: would change to qi’, and the ratio of n’/n is 
given by 


n’ _ log (1 — qr) 
n log (1 — qi) 

which is independent of p. This equation was used to derive a table (next 

page) which contains the number of satellites needed for various q: rel- 


ative to the number needed for qi = 0.999. 
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Fig. 8 — Quality of service for single satellite visibility, q.1, for a range of 


values of p and n. 
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(it) Multiple-Satellite Visibility 


Some pairs of ground stations may require more than one satellite at 
a time if the traffic between the two stations exceeds the capacity of one 
satellite. If m ground antennas were used at each station, m communica- 
tion paths could be established by way of separate satellites. The qual- 
ity of service on path m, which is the fraction of time m or more satel- 
lites are usable by the ground stations, is given by 


Gm = 1 —- pI O) pi(l — p)"* (4) 


= 


Figure 9 gives values of qm as a function* of q. For a value of qm = 
0.999 on the abscissa, the intersections of the ordinate line with the 
curves give gq: = 0.991, gz; = 0.964, etc. q1, gz, and q3 would be the 
qualities of service for three paths between Maine and London if the 
paths were operated on a priority basis, i.c., if path 3 operated only 
when three or more satellites were visible, path 2 only when two or 
more were visible, and path 1 when one or more was visible. Figs. 8 and 
9 may be used to determine the number of satellites needed for a par- 
ticular gm . Suppose q2 = 0.98 is desired. From Fig. 9, gq. = 0.9975. If 
we again use the example of p = 0.23, Vig. 8 shows 23 satellites would 
be needed. 

Multiple paths between two points may be operated with equal pri- 
ority simply by assigning periods of no service to the paths in turn. The 
lengths of these periods are not equal, but over a long period of time 
all paths would get equal qualities of service. Suppose there are three 
paths (m = 3). One of the paths would be out of service whenever only 
two satellites are visible, two would be out whenever only one satellite 
is visible, and all would be out when no satellites are visible. The frac- 
tions of these times for which a particular path would be out of service 
are 4, 3, and 1 respectively. The general fraction is (m — 7)/m, where m 
is the number of paths and 7 is the number of satellites visible (¢ < m). 
The average quality of service on each of m paths with equal priority is 


Gm = 1 — > ace (") p(l — p)”" (5) 


m 





* This relationship varies slightly with p. The curves in Fig. 9 result from using 
p = 0.18. The values of p which are of interest in this work range between 0.05 
and 0.30. The changes in the curves of Fig. 9 for these extreme values of p would 
be very small, so the curves can be used with reasonable accuracy for any-p in 
this range. 
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Fig. 9 — Fraction of time qg,, that m or more satellites are visible. In a system 
with multiple paths between two ground stations g,, is the quality of service on a 
path with priority m. The number of satellites needed to furnish gm is found by 
entering Fig. 8 with q: and p. 


Tig. 10 gives values g, as a function* of q.. For q: = 0.999, @2 = 
0.995, G3 = 0.985, etc. A previous example showed 27 satellites in 6000- 
mile polar orbits would provide q: = 0.999 between Maine and London. 
The above results show three equal-priority paths between these points 
would have qualities of service of 0.985. If 0.999 service were desired 
on each path, Fig. 10 shows qi = 0.99996, so 39 satellites would be 
needed (Fig. 8 with p = 0.23 and qi = 0.99996). This procedure can be 
used to determine the service from one point to several other points 
when the mutual visibility regions are almost identical. For example, 
stations in England, France, and Germany could be considered to be at 
about the same latitude and longitude, so 39 satellites would provide a 


* The previous comment on the effect of different values of p also applies here. 
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Fig. 10 — Quality of service Gm on each of m paths with equal priority. The 
number of satellites needed to furnish g,, is found by entering Fig. 8 with qi and p. 


qi of about 0.999 to each of these points with a separate satellite for 
each path. 


4.1.2 Mixed Orbits 


A previous section indicated that no single orbit inclination is best 
for all pairs of stations. This may be verified by examination of the 
table in the Appendix. Miami-Rio de Janeiro, for example, would have 
a quality of service of only 0.80 if the 27 polar satellites discussed before 
were used. Equatorial orbits are best for this pair of stations, but these 
orbits provide little service to Maine—-London. This implies that satellites 
should be placed in both kinds of orbits if both pairs of stations are to 
be served equally well. It is also possible that a single intermediate in- 
clination would be best. 

Suppose a system consists of a total of n satellites distributed in * 
different orbits. Let the jth orbit contain n; satellites, and let p; be the 
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fraction of time a satellite in this orbit is visible to a pair of stations. 
The quality of service for a single satellite is then 


k 
ga=1- I] (Lp) (6) 
7 
where n= >. 7; 


A solution of (6) for the above Miami-—Rio de Janeiro example shows 
that 22 satellites in 6000-mile equatorial orbits would need to be added 
to the polar satellites to make q, = 0.999. However, Maine—London also 
gets some service from the equatorial satellite, so less than 27 polar 
satellites are now required to give 0.999 service to this path. Thus the 
solution must be adjusted until it converges to the point where 0.999 
service results for both paths. 

The problem of mixed orbits is concerned with the specification of 
the number of satellites which should be placed in each of k different 
orbits in order to satisfy the usability requirements of several ground 
stations. If the satellite altitudes are the same, the best set of orbits 
might be considered to be the one which requires the smallest number 
of satellites.* The solution may be approximated by determining the 
number of satellites in each orbit individually and in sequence as in the 
above example. This method is difficult and time-consuming for complex 
problems, and a more direct approach is to use a linear-programming 
model. Such a model has been developed, but it is beyond the scope of 
this paper. 

A more general problem of mixed orbits involves the determination 
of the set of satellites, in orbits of all possible altitudes and inclinations, 
which will furnish required qualities of service for a minimum cost. The 
linear-programming model represents a large step toward the solution 
of this problem; however, other problems related to costs and functional 
relationships between other system parameters must be thoroughly in- 
vestigated before the problem can be solved. 


4.1.3 The Effect of Overlap of Mutual Visibility Regions 


In Fig. 5, nearly all of the mutual visibility region for Seattle-Tokyo 
is contained within the regions for Maine—London and Seattle-Hawaii. 
When a satellite is mutually visible to Seattle and Tokyo, this satellite 


* This is really an oversimplification. Depending on the location of the launch- 
ing sites, the payload which a given vehicle can put in orbit depends on the in- 
clination angle. With the possibility of launching several satellites with a single 
vee minimizing the number of satellites does not necessarily minimize the 
total cost. 
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may already be in use by one of the other pairs of points. The previous 
discussion on multiple-satellite visibility indicates the problem could 
be solved most of the time by using other satellites which are also visi- 
ble. The effect of this overlap on the qualities of service for each pair 
of points will be determined to illustrate the magnitude of the problem. 
Tor simplification, the overlap of the Seattle-Hawaii region with the 
Maine-London region will be ignored. 

The Seattle-Tokyo path would be out of service if either of the follow- 
ing three conditions prevails: 

(t) There is no satellite in the Seattle-Tokyo region. The fraction of 
time this would occur is given by (1 — pg-r)” where the sub- 
script identifies the region. 

(77) There is exactly one satellite in region A (the area common to 
the Maine—London and Seattle-Tokyo regions) and there are no 
satellites in the remainder of the Maine—London region and there 
are no satellites in the remainder of the Seattle-Tokyo region. 
The fraction of time the event occurs is 


[rpa (1 — pa)” If = (iar = pal Ul — (ps-r — pal” 
If all paths have equal priority, Seattle-Tokyo would get the 
satellite in one-half of the situations. Thus one-half of the above 
expression is the fraction of time the path is out of service be- 
cause of overlap with Maine—London. 

(1it) Same as condition (77) except region B (the area common to the 
Seattle-—Hawaii and Seattle-Tokyo regions) replaces region A 
and the Seattle-Hawaii region replaces the Maine—London re- 
gion. 

As a numerical example, a system of 27 satellites in 6000-mile polar 

orbits was considered. Table I summarizes the effect of the overlap. 

The effect of overlap is difficult to determine in a more complex 

situation; but because the effect is small, one can usually account for 
it by nominally reducing the quality of service or nominally increasing 
the number of satellites. In this example, the addition of one satellite 
would compensate for the overlap, i.e., g: for 28 satellites with overlap 
considered would be as good as q: for 27 satellites with overlap not in- 

















TABLE [ 
Path qi if Overlap is not qi including the Fraction of Service Lost 
Considered Effect of Overlap Because of Overlap 
Seattle-Tokyo 0.9876 0.9841 0.0035 
Seattle-Hawaili 0.9904 0.9874 0.0030 


Maine-London 0.9991 0.9987 0.0004 
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cluded. Complex overlap situations can better be evaluated by a com- 
puter simulation which has been developed to describe the service fur- 
nished by systems of satellites. 


4.2 Description of the Periods of Service and No-Service by Simulation 


Knowledge of the fraction of time a path is out of service is not suffi- 
cient to evaluate the service provided by a system of satellites. The de- 
scription of the length and frequency of these periods of no service is at 
least equally important. For a given quality of service, the no-service 
periods could be seconds long and occur frequently, or they could be 
days long and occur only occasionally. Certainly the way the no-serv- 
ice periods would be evaluated and administered would be different for 
these two extremes. 

To describe the periods of service and no service, a computer program 
which simulates a world-wide satellite communications system was 
used. The program assigns satellites to the several ground-station pairs 
in a system in a way which maximizes the service on a priority basis (the 
paths may have equal or ordered priority). The motions of the system 
may ke simulated for any length of time, and periods of several years 
may be studied with modest amounts of computer time. The periods of 
no service are recorded for each ground-station pair, and a statistical 
description of the service periods and periods of no service is provided. 
Both random and nonrandom systems can be studied. The results pre- 
sented in this section assume random ascending nodes and phases for 
the satellites. 


4.2.1 Description of Service for Various Qualities of Service 


Tig. 11 displays the periods of no service on a path from Maine to 
London for a system of 12 satellites in 6000-mile random polar orbits (a 
quality of service of 0.95). This figure covers a representative 30-day 
period from a one-year simulation. The darkened areas, which are 
drawn to scale, represent the times when the path is out of service be- 
cause no satellites are visible. The average length of the periods of no 
service (¢,) and the average time between these periods (¢;) are given 
for the one-year period. Fig. 12 presents the periods of no service for 
the Maine-London path if the 6000-mile polar orbit system has 18 
satellites (¢ = 0.99), and Fig. 13 gives the same information for a sys- 
tem of 27 satellites (¢ = 0.999). 

The communication system will probably need to be operated in such 
a way that periods of no service do not interrupt communication which 
is in progress. This would be accomplished by keeping new traffic off 
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Fig. 11 — Maine-London service from 12 satellites in 6000-mile polar orbits. 


the system during a “guard interval” which precedes the no-service 
period. The guard interval would be long enough to insure, with a 
reasonable probability, that all customers would be finished by the time 
the no-service period starts. 

With this method of operation, the no-service periods can be inter- 
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preted simply as delays in service to new customers. The periods are 
predictable weeks in advance, and they are short; therefore, they are not 
the same as catastrophic system failures such as cable breaks and re- 
peater failures.* As far as ordinary toll customers are concerned, a short 
no-service period is the same as a temporary saturation of the system 
by heavy traffic. Tt In view of the above comments, the no-service periods, 
which are popularly called ‘outages,’ might more appropriately be 
called “‘service delays.” 

The daily period of heavy traffic between most countries is only 
several hours long. The service delays which would occur during the 
busy period would probably be of primary importance, and those out- 
side this period would be secondary. As an example assume the busy 
period is four hours long. Then, for the 0.99 service shown in Fig. 12, 
there would be no days in which more than one service delay would 
occur during the busy period. The absolute time in the figure is arbi- 
trary, so the busy period could occur at any time. If all possible loca- 
tions of the busy period are considered, the 30-day period shown could 
have as many as eight or as few as zero service delays during busy 
periods. The maximum number for the 0.95 service shown in Fig. 11 
would be between six and twenty for that 30-day period, and the 0.999 
service of Fig. 13 would have between zero and two delays during busy 
periods. In comparing these three services, the numbers of satellites 
needed in each case should also be considered since the cost of satellites 
in orbit will be a large part of the total system cost. 


4.2.2 The Effect of Orbit Altitude on the Service 


Fig. 14 is a representative display of the delays which would occur on 
the Maine—London path if a system of 18 satellites in 3000-mile ran- 
dom polar orbits were used. The quality of service is 0.95, which is the 
same quality as that of Fig. 11. In the 3000-mile system the service 
delays would be more frequent but of shorter duration. These differences 
are caused by the shorter orbital period and shorter visibility times of 
the 3000-mile satellites. 

If the operational method discussed previously is used, a fixed guard 
interval would be added to each of the delays in the figures. The length 
of the guard interval is determined by lengths of te’ephone calls, so it 
would be the same for all cases. It is apparent that the effect of adding 

* There will also be repeater failures in satellites, but the effect of these fail- 
ures is only to reduce the quality of service. For example, if a system started 
with 27 satellites and 0.999 service (Fig. 18), failure of one-third of the satellites 
would only reduce the service to 0.99 (Fig. 12). 


+ The effect may be more severe for private line customers who require a circuit 
continuously. 
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Fig. 12 — Maine-London service from 18 satellites in 6000-mile polar orbits. 


the guard interval to each of the delays in Fig. 14 would be more severe 
than if it were added to the delays of Fig. 11. 

Although it may appear that for a given quality of service a high 
orbit would always give better service than a lower orbit, this is not 
strictly true. As the satellite periods and visibility times become longer, 
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Fig. 13 — Maine-London service from 27 satellites in 6000-mile polar orbits. 
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Fig. 14 — Maine-London service from 18 satellites in 3000-mile polar orbits. 
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the delays occur less frequently, but they also become longer. Even- 
tually a point would be reached where the delays in service would be 
excessive and would approach the severity of catastrophic failures. 


4.3 Analytical Description of Periods of Service and No-Service 


A number of persons have made analytical studies of the lengths of 
service delays (outages) and the time between these delays. Some of 
these expressions are included here for completeness. Additional results 
are given in a companion paper by S. O. Rice’ who makes use of a traf- 
fic model. The results predicted by the model compare favorably with 
simulated results described in the previous section. 

The average length of the service delays is given by 


= (1 — en (7) 


to 


n and p have been defined previously, and T' is the average time between 
successive appearances of a particular satellite. If the visibility region 
were circular and centered on the north pole, 7 for polar orbits would 
be equal to the orbital period P, which is given in Fig. 2. For polar or- 
bits, P is a good approximation for 7 for regions which include one of 
the earth’s poles such as the Maine—London region in Fig. 5. For other 
regions, such as Seattle—Hawaii, only a fraction of all possible orienta- 
tions of the orbit plane with the earth result in visibility, so there will 
be some times longer than P between successive appearances. An ex- 
pression for 7' for these cases has not been determined. For equatorial 
orbits, T would equal P if the earth were not rotating. For the kind of 
equatorial orbits assumed here, the satellite moves in the direction of 
the earth’s rotation, so the time between successive appearances above 
a point on the earth is longer than P. In this case, the expression for 7 
is 





an (8) 


1 34 


The average time between service delays is given by 





fetes ate (9) 
1— Mm 


Some useful approximations for the distribution of the lengths of delays 
and times between delays have also been given by Ref. 5. The fraction 
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of delays which would be longer than ¢ is approximately exp(—t/é.), and 
the fraction of delays which are separated by a time longer than ¢ is 
approximately exp(—t/t;). 

Equations (7) and (9) and the exponential approximations for the 
distributions have been compared with many simulated results, and the 
agreement testifies favorably for the usefulness of these expressions in 
describing random-orbit service analytically. 


V. AN EXAMPLE OF A WORLD-WIDE SYSTEM 


The material given in the previous sections discusses techniques which 
can be used to describe the service provided by a set of satellites. The 
application of these techniques is illustrated in this section by describing 
the world-wide service which would be provided by a particular system 
of satellites in 6000-mile orbits. This example system provides essen- 
tially uniform service throughout the world with a modest number of 
satellites. 


5.1 Description of the Orbits 


This system would consist of 24 satellites in 6000-mile circular orbits. 
It is assumed that four satellites would be placed into orbit with a 
single rocket, so each plane would contain four satellites. Three of these 
planes would be polar, and three would have an inclination of 28°.* 
The three planes in each group would be uniformly spaced (their ascend- 
ing nodes would be 120° apart) as shown in Fig. 15. The lines on the 
figure represent the intersections of the orbit planes with the surface of 
the earth at an arbitrary time. A discussion of the motion of the orbit 
planes and its effect on service is given in Section VI. A possible scheme 
for effecting the separation of the multiply-launched satellites is also 
discussed in Section VI. For the present, it is assumed that the 24 
satellites are randomly phased. 


5.2 Service from the United States 


Fig. 16 shows the qualities of service which would be provided by 
the system of 24 satellites if stations were placed in Maine, Seattle, 
Puerto Rico, and Guam. A point within the 0.99 contour would be able 


* Hquatorial orbits would be more useful in this system than 28° orbits; how- 
ever, a rocket launched from Cape Canaveral must use difficult, power-consuming 
maneuvers to achieve an equatorial orbit. It is shown in Ref. 6 that the Atlas 
Agena can place about 1300 pounds into a 6000-mile, 28° orbit but can place only 
about 200 pounds into an equatorial orbit of the same altitude. About 1000 pounds 
can be placed into a 6000-mile polar orbit from the Pacific Missile Range. 
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Fig. 15 — Orientation of the orbit planes of the 24-satellite system. The dotted . 
lines are the intersections of the polar planes with the surface of the earth, and 
the solid lines are the intersections of the 28° planes with the earth’s surface. 
The planes are numbered at the ascending nodes of the orbits. 


to communicate with one of these stations more than 0.99 of the time. 
A point north of the 0.95 contour would have service with one of the 
stations more than 0.95 of the time, etc. These contours were deter- 
mined by using (6) in Section 4.1.2 for values of p from the table in the 
Appendix. The location of the ground stations in the figure were chosen 
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in an attempt to reach as much of the world as possible from points in 
the United States and its outlying territories. 

In order to be useful, the stations in Puerto Rico and Guam would 
have to be connected to circuits in the United States. For Puerto Rico, 
this connection could be made with Maine by way of a satellite or with 
Florida by submarine cable. The length of the path through a single 
satellite link, i.e., the distance from one ground station through the 
satellite to another ground station, can be as much as 17,000 miles for 
certain positions of a satellite at a 6000-mile altitude. Thus a circuit 
from South America to Maine consisting of two satellite links in tandem 
could be as long as 34,000 miles. It is well known that the performance 
of telephone circuits is adversely affected by the time delay associated 
with long path lengths, and ideally the path should be as short as possi- 
ble. Thus for telephone communication with South America and Africa 
by way of Puerto Rico, the cable connection from Puerto Rico to Florida 
would probably be more desirable than a satellite connection with 
Maine; however, both alternatives may be desirable to provide diver- 
sity. 

Guam could be connected to the United States by using a satellite 
link from Guam to Hawaii and another satellite link from Hawaii to 
Seattle. Circuits from Asia to the United States would then consist. of 
three satellite links in tandem which could be as much as 50,000 miles 
long. A submarine cable which will connect Guam to the United States 
is presently planned for service in 1964. This cable would probably be 
a more attractive way of relaying telephone circuits to the United States | 
than would the two satellite links, but again it may be desirable to have 
both means available. 

The plan of Fig. 16 resulted from an attempt to reach as much of the 
world as possible through facilities under control of the United States. 
Most of the world could be reached with reasonably good qualities of - 
service, but the service to Africa is probably not adequate. A plan which 
results in better service to this area will be given in Section 5.4. 


5.3, Service from Europe 


Fig. 17 shows contours of constant qi from an arbitrary point in west- 
ern Europe. Much of the Pacific area cannot be reached from Europe 
by a single satellite link because the two regions are on opposite sides 
of the world. Africa could be served from Europe about as well as South 
America could be served from the U. 8., and Europe-South American 
service would be limited to about the same extent as U. S.—African serv- 





Fig. 17 — Qualities of service from a West-European point for a system of 24 satellites 
28°-inelined). 
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ice. Service to all of these areas could, of course, be improved by estab- 
lishing additional ground stations at appropriate points and using 
multiple satellite links in tandem. 

Other contours could be drawn with respect to other parts of the 
world; however, the contours in Figs. 16 and 17 are probably adequate 
to describe world-wide service. These contours can be moved in longi- 
tude or they can be inverted and centered at the corresponding latitude 
in the southern hemisphere. The next section presents a plan in which 
consolidation of traffic could be used to provide more efficient communi- 
cation between all parts of the world. 


5.4 A Possible Method of Operation for World-Wide Service 


North America has an extensive land-line network which would en- 
able all points to communicate with overseas points through one of the 
assumed ground stations in Fig. 16. Europe also has an extensive net- 
work, but land lines between countries on other continents are not as 
well developed. Each of these countries would need a direct connection, 
by way of a satellite, with a network through which other countries of 
interest could be reached. 

A possible scheme for world-wide communication is shown in Fig. 18. 
In this plan, countries within the encircled regions would communicate 
with stations in western Europe, Guam, Seattle, Maine, or Miami.* 
Since these stations would be interconnected by submarine cables and 
land lines, any of the encircled regions could be reached from any of 
these stations by using cable facilities and one satellite link, e.g., U. S.— 
African circuits could go by cable to Europe and then by satellite to 
Africa. Any two encircled regions could communicate by using two 
satellite links which would be connected by cable facilities, e.g., circuits 
from Asia to South America could go by satellite to Guam, then by 
cable and land lines to Miami, and then by satellite to South America. 

The ranges in quality of service shown for each path in the figure 
assume that all countries within the encircled region could use the same 
satellite. The South American service will be discussed to point out 
some implications of this kind of operation. In Fig. 18, stations A, B, 
and C in South America would always use the satellite which is mu- 
tually visible to Miami and station A. If no satellite satisfied this con- 
dition, station A would be out of service, and a satellite which was 

* This plan does not require direct communication by satellite between the 
U.S. and Africa, and the service to South America from a station in Puerto Rico 


would not be greatly different than the service from Miami would be. For this 
reason, Miami was chosen as the U. S. station for this part of the world. 
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mutually visible to Miami and B would be used, etc. Thus stations close 
to Miami would have high qualities of service and those further from 
Miami would have lower qualities of service. As shown on the figure, 
the range of gq from Miami to countries in South America is 0.90-0.99. 

Communication between countries in South America could also be 
accomplished by way of the satellite which is used to reach Miami. The 
quality of service between two countries would be approximately the 
product of the qualities which the individual countries would have with 
Miami. The service between these countries could be improved if they 
also used mutually visible satellites which were not visible to Miami, 
but this would require each country to have another ground antenna. 

According to Fig. 18, South America and Africa would communicate 
over a long network of land lines, cables, and two satellite links, but it 
is apparent that direct communication by way of one satellite link be- 
tween these regions would be possible. However, this direct communica- 
tion would require all South American countries and all African coun- 
tries to have another ground antenna specifically for this purpose. This 
would not be true if each of these areas had a land-line network which 
interconnected all countries. Direct communication between Europe 
and South America could also be achieved if each South American coun- 
try had still another ground antenna. The discussion of South American 
service can be extended to the other regions which are encircled in Fig. 
18. 

The method of operation illustrated in Fig. 18 minimizes the number 
of ground antennas needed by individual countries by consolidating 
traffic through central points which are interconnected by terrestrial 
facilities. With this method, routes which are frequently used could be 
served by single-satellite links, and other routes which are used less fre- 
quently could be served by two satellite links in tandem. 


5.5 Alternate Routing by Way of Two Satellite Links 


During the times when no satellites are mutually visible to a pair of 
stations, each station will often be able to see a satellite which is out of 
view of the other station. If there were a relay point which could see 
both of these satellites, a path with two satellite links could be estab- 
lished by way of the relay station. For example, Anchorage, Alaska might 
be a useful relay point for Maine and Europe. Iigs. 16 and 17 give quali- 
ties of service of about 0.95 and 0.90 for Maine—Anchorage and Anchor- 
age—Europe respectively. Another location which is well-situated for 
relays between Maine and Europe is the Cape Verde Islands off the 
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west coast of Africa. This relay point could also provide an alternate 
route for Miami-South America and Europe—Africa. 

The 24-satellite system was simulated for a four-month period in 
order to obtain an appreciation for the usefulness of the alternate-rout- 
ing concept. In this simulation the Maine—Kurope path had 98 periods 
during which a satellite was not mutually visible. During 53 of these 
periods, satellites were mutually visible between Maine and Anchorage 
and between Anchorage and Europe, so an alternate route could have 
heen established. A relay in the Cape Verde Islands would have been 
useful during 31 of the remaining 45 periods. Thus about 86 per cent of 
the service delays during this simulated period could have been avoided 
by using these two relay points. Of course some penalty would remain, 
during these periods, in the form of degraded communication perform- 
ance, but it is possible that this degradation could be tolerated for the 
short and infrequent intervals which would be involved. 


5.6 System Growth and Service Improvement 


It is expected that over a period of years an increase in demand and 
a desire to improve qualities of service would require the expansion of 
an initial system. The orbits in which satellites are added to the system 
would depend on the global distribution of the increase in demand for 
service. It is likely that polar satellites would be added at a faster rate 
than would 28° satellites, and if this were the case, the shapes of the 
contour lines would change from those given in Figs. 16 and 17. The 
service improvement offered by increases in the number of satellites 
will be illustrated by a simple example in which the satellites in each 
orbit are increased by equal numbers. 

As noted in Section 5.2, the contours of Figs. 16 and 17 represent 
qualities of service for the 24-satellite system determined by the equa- 
tion 


1—qm= (lL — py)? (l= pa)? 


If the number of satellites in each of the two orbit inclinations is changed 
to n, it follows that 


d—q@’) =(—q)””, 


where qi’ is the new quality of service for a contour line. Therefore, 
these contours remain valid, with the appropriate changes in qi, for 
different numbers of satellites in this type of system. The following 
table presents the values of gq: for each of the contour lines for several n. 
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n= 12 16 20 A 
Inner contour = 0.99 0.998 0.9995 0.9999 
Second contour = 0.95 0.982 0.9934 0.9975 
Third contour = 0.90 0.954 0.978 0.99 
Outer contour = 0.75 0.842 0.901 0.937 


An increase in gq: which results from an expanded system may be con- 
sidered solely as an improvement in service quality. However, this in- 
crease also affects the number of paths which can be made available 
between two points. I’or example, from the above table, for n = 20, 
gq. = 0.9995 for the inner contour. Inspection of Tig. 9 shows that 2 
paths could be established with ordered priority with q2 = 0.995; simi- 
larly, for three paths, g3 = 0.978 etc. From Fig. 10 similar information 
may be obtained for paths with equal priority. For the above example 
Gz = 0.9973, G3 = 0.9915, ete. 

The communication scheme of Fig. 18 is also valid for an expanded 
system of an equal number of satellites in 28°-inclined and polar orbits. 
The qualities of service indicated would be changed by replacing them 
by their corresponding values in the above table. 


VI. SOME NONRANDOM CHARACTERISTICS AND THEIR EFFECT ON SERVICE 


The system described in the previous section would be nonrandom in 
two respects. In the first place, the four satellites in each plane are not 
controlled in phase, so these satellites could bunch together. If this 
should occur, the four satellites would be of little more use than a single 
satellite. Secondly, the orbit planes will rotate slowly about the earth’s 
axis at different rates, so the three orbit: planes in each inclination could 
coincide after a long period of time. Certain parts of the world could 
then see the satellites only during parts of the day. This section dis- 
cusses these two nonrandom characteristics and presents simulation 
results which show their effect on service. 


6.1 Itelative Satellite Velocities and Phasing of Successive Launches 


If several satellites were placed into the same orbit plane, the most 
uniform coverage would result if the satellites were uniformly spaced. 
However, even if uniform spacing could be achieved initially, it is not 
possible to give all satellites exactly the same velocity. Thus the satel- 
lites would drift from the desired relative positions, and to preserve the 
uniform spacing, a propulsion system would be needed on each of the 
satellites. This would probably represent a severe requirement on a 
system which must have an appreciable lifetime. 

Iig. 19 illustrates one method of separating four satellites in such a 
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Fig. 19 — Geometry of satellite separation: (a) Incremental velocities relative 
to satellite *¥1. (b) Relative positions after 3 months in orbit. 


way that the effect of future bunching of the satellites would be small. 
In Fig. 19(a) the satellites initially would be moving counter-clockwise 
with the same velocities (they would be placed into orbit as one pack- 
age). Then the satellites would be separated by springs in order to give 
three of them the indicated velocities relative to satellite number 1. 
T‘igure 19(b) shows the relative directions of motion and the relative 
positions after three months in orbit. A satellite which is given an in- 
crease in velocity goes into a slightly higher elliptical orbit with a 
slightly longer period, and its angular velocity therefore decreases. This 
is the reason satellites 2 and 3 fall behind satellite 1 even though they 
have higher velocities. The reverse is true of satellite 4. 

The relative velocities were determined by a computer simulation. 
Many combinations of velocities (all small enough to be within the 
capability of springs and large enough to effect early separation) were 
tried, and the relative positions of the satellites were followed for a 
period of ten years. The set shown in Tig. 19 is the one which resulted 
in the most uniform coverage over the study period; however, errors of 
+3 per cent could be tolerated in each of these velocities without sig- 
nificantly changing the coverage. 

The three planes of satellites in each group would no doubt be 
launched a month or so apart because of preparation time and availa- 
bility of launch facilities. There is also a coverage advantage to be 
gained from phasing the launchings. Some pairs of stations could see 
all polar orbits at all times, and other pairs could see all 28° orbits at 
all times. If the satellites in each of the planes were given the set of 
velocities shown in Fig. 19 at different times (different launch dates), 
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the satellites in one plane could be made to be well-spaced at times when 
the satellites in another plane were bunched. 

The desired intervals between launchings were also determined by 
computer simulation. The first set of satellites was allowed to disperse 
for one month, and then many combinations of second and third launch- 
ing times over a six-month period were tried. or each combination, the 
coverage was examined for a ten-year period. The best' combination 
which was found specified launching the second plane 1.5 months after 
the first, and the third plane 4.5 months after the first. If either attempt 
were to fail, there would be other times several weeks later which would 
serve equally well. The two groups of planes would be handled separately 
from different launching points, and no phasing between groups would 
be attempted. 

The above procedures are not necessarily optimum, but they do repre- 
sent a way of achieving orbits which do not result in severe bunching of 
the satellites within the planes. 


6.2 Motion of the Orbit Planes 


The plane of the orbit of an earth satellite will not in general maintain 
a fixed orientation in inertial space. Instead, there will be a slow rotation 
of the plane about the earth’s axis. This rotation is caused by the non- 
spherical shape of the earth, and for circular orbits the rate of rotation, 
in radians per orbit, is given by’ 


2 
where 

R = radius of the earth 

H = altitude of the satellite above the earth 

J = 1.625 X 10°”, the first oblateness coefficient? 

t = inclination of the orbit plane. 


The direction of the rotation of the plane is opposite to that of the 
earth’s rotation for inclinations less than 90°, and is in the same direc- 
tion as the earth’s rotation for inclinations greater than 90°. The rota- 
tion vanishes for true polar orbits (cos 7 = 0), and a 6000-mile, 28° 
orbit rotates at a rate of about 100° per year. 

In the system shown in Fig. 15, the three orbit planes in each group 
would have no relative motion if each plane has the same inclination. 
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Small errors in inclination can be expected, however, and the resultant 
relative motions could cause the planes to bunch together after a num- 
ber of years. The magnitudes and directions of the inclination errors 
which are made will determine the extent of the bunching and the time 
of its occurrence. 

If it is assumed that the accuracy of the inclination angle will be 
within 5°, which is probably a pessimistic assumption, the most severe 
bunching of the near-polar planes would result for inclinations of 85°, 
90°, and 95° respectively for planes 1, 2, and 3 in Tig. 15. With these 
inclinations, plane 2 would not rotate, plane 1 would rotate clockwise at 
a rate of about 10° per year, and plane 3 would rotate counter-clockwise, 
also at a rate of 10° per year. Thus the three planes would be nearly 
coincident* after about 6 years. 

The worst combination of inclination angles for planes 4, 5, and 6 
would be 33°, 28°, and 23° respectively. All of these orbits would rotate 
in a clockwise direction, but plane 4 would rotate about 5° per year 
less than plane 5, and plane 6 would rotate about 5° per year more than 
plane 5. The planes would be closest to coincidence when their ascend- 
ing nodes were at the same position. Since the ascending nodes would 
be 120° apart initially, nearest-coincidence of the three planes would 
occur after 24 years. 

The bunched system which will be discussed here is shown in Fig. 20. 
The orbits have the inclinations discussed in the preceding paragraphs, 
and the figure represents the orientation of the planes six years after 
the satellites are placed into orbit. Note that the assumed inclinations 
represent a rather pessimistic example since 5° errors would need to be 
made in exactly the right combination. The maximum error may also 
be pessimistic, and a maximum error of one-half this value would delay 
this bunching until twelve years after the system was started. 

Even if the original system bunched together in this way after six 
years, it is quite likely the effect would not be severe if satellite failures 
and system growth are considered. The logical way to replace satellites 
which fail would be to establish a new plane with four satellites in it 
rather than try to replace individuals in the original planes.t The new 
plane could be placed in a gap which was caused by bunching. It is 
quite likely that more than 24 satellites would be needed by this time 
to improve the quality of service and to establish multiple paths be- 
tween some stations. These additional satellites would provide another 
way to fill gaps caused by bunching of planes. If one considers such 


* The planes would not be exactly coincident because they would have different 
inclinations. 
+ Assuming one rocket will place four satellites into orbit. 
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Fig. 20 — Orientation of the orbit planes of the 24-satellite system for a severe 
case of plane bunching. This bunching would occur after six years if a certain 
combination of 5° errors were made in the inclinations of four of the planes. 


things as satellites failing at random, new satellites being added with 
random errors in inclination, and motions of orbit planes in different 


directions and at different rates, it is not difficult to postulate the evolu- 
tion of a random system after a number of years. 


6.3 A Study of the Effect of Plane Bunching 


This section presents simulation results for three different systems of 
satellites. The first of these is the uniformly spaced system represented 


1660 THE BELL SYSTEM TECHNICAL JOURNAL, SEPTEMBER 1962 


by Fig. 15, the second is the bunched system of Fig. 20, and the third 
is this same bunched system with four satellites added in another 28° 
plane (the ascending node of the plane is 180° from that of plane 5). 
The services each of these systems would furnish to Maine—London, 
Seattle-Hawaii, and London—Johannesburg were simulated for six- 
month periods. 

In the uniform system, the phases of the satellites were specified by 
using the relative velocities and phasings of successive launches dis- 
cussed previously. The bunched system was described by advancing 
these phases through six years and arranging the ascending nodes and 
inclinations to correspond to those of Fig. 20. 


6.3.1 Maine—London Service 


Fig. 21 shows ten-day portions of the simulated services of the three 
systems for the Maine-London path. Fig. 21(a) is a representative 
ten-day period for the uniform system. The statistics qi, t , and ¢; (see 
Section IV) are given for the ten days and for the six months to provide 
a means of comparing the services quantitatively. It can be seen from 
these statistics that the ten-day period shown for the uniform system 
has longer periods of no service and longer times between these periods 
than the averages for the six months. 

Fig. 21(b) shows the poorest service which resulted for the bunched 
system. The poorest ten days were chosen so that the effect of bunching 
of satellites within the planes would also be included. Although this 
group of days cannot be compared directly with the first group, the 
six-month statistics can be compared. For the six-month periods, q: is 
0.984 for both systems, and the periods of no service are longer but less 
frequent for the bunched system than for the uniform system. The ten- 
day statistics for the two systems differ by similar amounts from the 
six-month statistics of the uniform system. From these comparisons, it 
is difficult to say that one system is better than the other for this path. 

It is not surprising that this path would be only slightly affected by 
the bunching because most of the service is furnished by the near-polar 
satellites and Maine and London always have mutual visibility of these 
three orbits as shown in Fig. 5. Because the earth rotates under the or- 
bits, there would be two times (12 hours apart) each day when the por- 
tions of these orbits which could be seen by Maine and London would 
be a minimum. The periods of no service would most likely occur during 
these times, and a 12-hour pattern in these periods would result. This 
pattern 1s evident in Tig. 21(b). 


HOURS 
0123 4 5 6 7 8 9 10 If 12 13 14 15 16 17 18 19 20 21 22 2324 


6-MONTH PERIOD 10-DAY PERIOD 
q,= 0.984 Q, = 0.988 
(a) to =14.7 MIN to= 17.4 MIN 
tL =15.4HRS ti= 23.7 HRS 


6-MONTH PERIOD {0-DAY PERIOD 
q,=0.984 q., = 0.978 

(b) to= 18.4 MIN to =16.4 MIN 

tj = 19.0 HRS t; =12.4 HRS 


6-MONTH PERIOD 10-DAY PERIOD 
q,=0.992 q,= 0.992 
(c) to= 15.1MIN to= 7.9 MIN 
t,= 29.8HRS t, = 18.3 HRS 


Fig. 21 — Maine-London service: (a) Representative service furnished by the 
uniformly spaced system shown in Fig. 15. (b) Service during the poorest 10-day 
period from the 6-month simulation of the bunched system shown in Fig. 20. (c) 
Service during the period shown in Fig. 21(b) if four satellites in a 28° plane were 
added to the bunched system. 
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It will be shown later that the other two paths in the study would be 
more seriously affected by the bunching, and the third system with 28 
satellites will be studied as a means of relieving the effect of the bunch- 
ing. This system was also simulated for the Maine—-London path for 
completeness, and the results are given in Fig. 21(c). The same ten-day 
period is shown for Figs. 21(b) and 21(c), so a direct comparison can be 
made. It is clear from this comparison that the four additional satellites 
more than compensate for the possible, small effect of bunching on this 
path. 


6.3.2 Sealtle-Hawait Service 


Fig. 22 gives similar ten-day displays for the Seattle—Hawaii path. 
Comparison of the six-month statistics for the first two systems shows 
the service to be affected considerably by the plane bunching. In Figure 
22(b), the periods of no service tend to occur during the same part of 
each day, and this again is caused by the rotation of the earth under the 
orbits. Consideration of Fig. 20 and the visibility region of Fig. 5 
shows there would be a part of each day for which only orbit plane 6 
would be over the visibility region and it is during this time that the no- 
service periods would most likely occur. Twelve hours later orbits 4 and 
5 are over the visibility region but orbits 1, 2, and 3 are not; since the 
service at this time is still rather good, it is apparent that the effect. of 
the bunching of the polar orbits is not large. 

These characteristics suggest the addition of another 28° plane with 
an ascending node 180° from that of plane 5, and the service which would 
result from this system of 28 satellites is shown in Fig. 22(c). In this 
figure the no-service periods are considerably reduced from those of 
Fig. 22(b), and they occur during the two parts of the day when the 
polar orbits are not visible. Comparison of the six-month statistics for 
the 28-satellite system with those of the uniform 24-satellite system sug- 
gests that both systems would provide about the same service to the 
Seattle-Hawaii path. It might be said, then, that the four added satel- 
lites represent the penalty of the bunching for this path. 


6.3.3 London-J ohannesburg Service 


The service to the London—Johannesburg path is represented in Fig. 
23. Comparison of the first two groups shows that a severe pattern in 
the periods of no service is caused by the bunching of the planes, and 
comparison of the six-month statistics shows a definite degradation in 
the service to this path. The nature of the reduction in service is the 


HOURS 
0123 4 5 6 7°8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 2324 


6-MONTH PERIOD 10-DAY PERIOD 
d= 0.987 q, = 0.989 
(a) to = 21.7 MIN to= 19.3 MIN 


ti = 26.5 HRS ti= 29.7 HRS 


6-MONTH PERIOD {0-DAY PERIOD 

q,= 0.965 1 =0.955 
(b) to= 26.7MIN to = 28.1 MIN 
t= 12.3 HRS tj = 10.0 HRS 


6-MONTH PERIOD 10-DAY PERIOD 

Q,= 0.987 q,=0.986 
(c) to= 18.1 MIN to= 13.0MIN 
t.= 22.9HRS t, = 15.8HRS 


Fig. 22 — Seattle-Hawaii service: (a) Representative service furnished by the 
uniformly spaced system shown in Fig. 15. (b) Service during the poorest 10-day 
period from the 6-month simulation of the bunched system shown in Fig. 20. (c) 
Service during the period shown in Fig. 22(b) if four satellites in a 28° plane were 
added to the bunched system. 
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HOURS 
0123 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 at 22 23 24 


6-MONTH PERIOD {0-DAY PERIOD 
q, = 0.88 q,=0.88 

(a) to = 26.4 MIN to= 26.1 MIN 
ti =3.2HRS t= 3.2 HRS 


6-MONTH PERIOD 


{0-DAY PERIOD 


q,= 0.835 q, = 0.791 
(b) to= 34.0 MIN to = 38.6MIN 
t, = 2.9 HRS t; = 2.4HRS 


6-MONTH PERIOD 


10-DAY PERIOD 


q,= 0.925 q.,= 0.889 
(c) to=23.0MIN to= 25.0 MIN 
t,=4.8HRS t = 3.3 HRS 


Fig. 23 — London-Johannesburg service: (a) Representative service furnished 
by the uniformly spaced system shown in Fig. 15. (b) Service during the poorest 
10-day period from the 6-month simulation of the bunched system shown in Fig. 
20. (¢) Service during the period shown in Fig. 23(b) if four satellites in a 28° 
plane were added to the bunched system. 
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same as it was for the Seattle-—Hawaii path, and Fig. 23(c) shows that 
the four additional satellites would be more than adequate to overcome 
the effect. This statement is best supported by comparing the six-month 
statistics for the three systems. 


6.3.4 Summary 


The service furnished to Maine—London by the bunched system of 
Fig. 20 would be essentially the same as the service furnished by the 
uniform system. The services furnished to the other two paths would 
be degraded by the assumed bunching of planes, but this degradation 
could be compensated by adding four satellites in a 28° orbit. Two of 
the paths would have better service from this bunched system of 28 
satellites than they would have from the uniform system of 24 satellites. 
Thus the penalty of the bunching is less than four satellites. 

The assumed bunching would result after six years if large (5°) errors 
were made in the inclinations in exactly the right combination. A reduc- 
tion of the error by a factor of two would double the time required for 
bunching. Also, if errors in the range of 0°-5° were made on a random 
basis, the bunching after six years would not be as severe as that which 
was assumed. 

Satellite failures and system growth were not included in the study, 
but they represent important factors in the control of bunching. If four 
satellites were to fail from each group of twelve during the six years, 
their replacements could easily fill in the gaps caused by the bunching. 
These gaps could also be filled by satellites which might be added to 
increase the qualities of service from their initial values. Because the 
bunching is slow, the constant redistribution resulting from replacements 
and additions would be expected to make the bunching of planes of 
minor significance, and through these processes a random system would 
no doubt evolve. 


VII. CONCLUSION 


The service which would be provided by satellites in uncontrolled 
orbits has been described analytically and by simulation. Section IV 
provides equations and graphical and tabular material which can be 
used to describe the service of systems which are random, and the same 
material can be used to estimate the service which would be provided by 
systems which have certain nonrandom characteristics. Results of com- 
puter simulations of several systems have been presented to develop an 
intuitive feeling for various qualities of service. 
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The example system which was studied shows that rather good world- 
wide service could be started with a modest number of satellites. The 
usefulness of existing facilities in extending this service and in providing 
for efficient consolidation of traffic has also been illustrated. A method of 
improving the continuity of service by using alternate relay points has 
been included. 

The nonrandom characteristics of the example system were investi- 
gated by computer simulation to determine their effect on service. A 
method of phasing multiply-launched satellites was presented, and the 
effect of bunching of orbit planes was studied. A severe case of bunching 
was shown to degrade the service to some representative pairs of sta- 
tions, but this degradation was shown to be more than compensated 
by a small increase in the number of satellites. Since the bunching of 
planes in ordered systems would occur at a slow rate, it is felt that re- 
placements of satellites which fail and the additions of satellites for 
growth would provide an adequate means of controlling the effect of 
this bunching. It is also felt that these replacements and the additions 
would result in the eventual evolution of a random system in which 
significant bunching would be less probable. 
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APPENDIX 


Table II presents values of p X 10% for numerous ground station 
locations, orbit altitudes and inclinations. The first three columns give 
the co-latitudes of the ground stations and their longitude separation 
in degrees. Because of symmetry with respect to the equator, two co- 
latitudes may be measured from either the south pole or the north pole. 
Some ground stations of interest are named next to their corresponding 
locations. Where one station is paired with several others, the first 
name is not repeated and the second is indented. 

The other columns are headed by H, the circular orbit altitude and 2, 
the inclination. Linear interpolation may be used between successive 
altitudes and inclinations with reasonable accuracy. 





Anchorage, Alaska~ 
Seattle 
Tokyo 


Lands End, England- 
Maine 
New Delhi 
Leopoldville 
Johannesburg 


Seattle- 


Tokyo 
Hawaii 


Maine- 


Rome 
Madrid 


Colat 1 


10 
20 


20 


28 
28 
30 
30 
30 


34 


39 
39 
39 
39 


40 


42 
42 
42 
42 


42 


45 
45 
45 
45 








Colat 2 


45 
45 


45 


42 
55 
45 
45 
45 


45 


45 
62 
94 
115 


76 


48 
55 
70 
71 


76 


45 
48 
49 
55 


Longitude 


100 
160 


28 
70 
130 
140 
180 


108 





65 
83 
20 
30 


76 
98 
33 
71 


93 


47 
83 





65 
100 


Difference} . 








TABLE II 


H = 2500 N. Mi 


075 
033 





000) 047 
026 


016 


000 
000) 013 
000} 089 





110 
059 


069 
040 


011 


027 
100 





70° 


149 
088 


094 
051 


009 


O41 
081 





90° 


184 
075 








124 
040 


009 


086 
038 
070 


000 


000 
000 
122 





060 








H = 4000 N. Mi 


115 
063 


078 
052 
123 
077) 053 





036 
127 


045 
077 





152 
093 


102 
069 


036 


053 
187 





© | 28°] 35°} 50°] 70° 


198 
130 


137 
090 


029 


076 
134 





90° 


239 
142 


154 


187 
082 


027 


160 
102 
110 


196 
142 
143 











030 


81 


000 
175 


116 





H = 6000 N. Mi 


| 28° | 35° 


088 
037 


000 


150 
090 
014 
007 
000 


060} 107 


161 
122) 094 





058 
158 


080 
108 
045 





H = 8000 N. Mi 


90° | 1.5 | 28°| 35°] 50° 


0 


186 
123 


133 
096 


062 


075 
166 


101 





242 
164 


185 
123 


050 


111 
i71 


134 


199 


266 
190 
189 
184 
176 


196 


230 
132 


046 


141 


200 
150 
148 
086 


060 


242 
165 
153 
150 


080 


188 


085 
033 
219 


152 


187 
151 


098 
122 





177 
114 


127 
095 


132 


074 
184 


217 
149 


154 
115 


085 


094 
193 











70° 


283 
192 


220 
152 


066 


148 
196 


90° 


304 
230 


222 


256 
180 


062 


231 
188 
179 


270 
170 
182 
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Colat 1 Colat 2 
Dakar 45 76 
45 80 
45 90 
45 90 
Rio de Janeiro 45 112 
45 120 
48 51 
48 58 
51 56 
Tokyo, Japan- 
Guam 55 76 
56 58 
56 69 
58 80 
Cairo, Egypt- 
Puerto Rico 60 71 
New Delhi, India- 
Sydney, Australia 62 123 
Miami, Florida- 
64 91 
Rio de Janeiro 64 112 
Santiago, Chile 64 123 
68 76 
Hawaii-~ 
San Francisco 70 53 
Manila 70 75 
Sydney 70 123 
Puerto Rico- 
Dakar 71 76 
71 89 


Longitude 


TaBLE I]—Continued 


Difference}. 


52 
100 


21 
40 


23 
41 
27 
98 


74 


37 


57 


32 
84 





53 


48 
63 






































H = 2500 N. Mi H = 4000 N. Mi H = 6000 N. Mi H = 8000 N. Mi 
*pe | 35° | 50°| 70°] 90°} Ge | 28°] 35°} 50°| 70° | 90°] ‘ge | 28° | 35°} S0°| 70°] 90° i =| 28° | 35°| 50°| 70° | 90° 
128 112 
041 046 
191 107 
200 080} 263 111} 305 131 
077| 033) 023) 018) 017] 138 077| 052) 041] 038) 188 119] 082} 063] 057] 225 159] 105] 082} 077 
138 054 
136 155| 218 237| 263 281) 302 317 
113 112) 184 171} 228 224| 282 295 
068 093] 136 154] 178 207 233 272 
193 235 165 277 
137 109] 197 166] 238 223] 286 291 
105 068] 160 109} 239 153| 227 180 
158 069| 209 105] 242 138] 266 164 
033 087 052 084 
001| 001) 001 068 023| 017| 014| 013} 100 060 040 032| 029| 126 091} 059| 046| 043 
219 232 140 298 
135] 055] 039] 031| 029) 181 108] 073} 058} 054] 214 153| 105] 084) 078) 241 198} 134| 103] 097 
133 191 077 234 
118 155 103 178 
129] 104] 107} 089| 076] 181 146| 146] 143] 118} 216 182} 178] 178] 164] 240 209} 203] 206] 189 
077| 068 
081] 028] 020} 016) 016] 128 077| 048) 039) 036) 166 121] 079] 061] 056) 200 154] 105] 080) 074 
140 180 116 202 
077 
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Leopoldville 


Guam- 


Bangkok 


Sidney 





71 
71 
71 
71 
71 


76 
76 
76 
76 
76 
76 
90 


94 
100 
120 
130 
150 


25 
76 
76 
100 
123 
130 
90 


81 
56 
36 


44 
50 
55 


33 























070 


150 





169 


205 














098 





233 





102 


195 


223 








129} 256 


120 


214 


259 














154 
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Intervals between Periods of No Service 
in Certain Satellite Communication 
Systems — Analogy with a 
Traffic System 


By S. 0. RICE 
(Manuscript received May 11, 1962) 


In satellite systems in which the relative positions of satellites are al- 
lowed to vary, there will be periods during which no service will be pro- 
vided between given ground stations. Such periods are called “‘outages,”’ and 
the intervals between successive outages are called ‘“innages.” Here the 
outage and innage time distributions are studied with the help of an anal- 
ogy between a satellite system and a traffic system. The arrival of a cus- 
tomer in the traffic system corresponds to a satellite coming into view, and 
the service time of the customer corresponds to the time the satellite remains 
in view. In particular, the methods of analysis developed for traffic systems 
are applied to determine an approximation for the distribution of innage 
lengths. 


I. INTRODUCTION 


Several types of communication systems have been proposed which 
would use repeaters orbiting the earth as artificial satellites. The prob- 
lem considered in this paper arises in systems employing a number of 
satellites at altitudes of several thousand miles. Typically, the orbit 
altitude might be of the order of 5000 miles with a period of revolution 
of about 5 hours. 

The companion paper by Rinehart and Robbins! discusses the condi- 
tions under which a particular satellite will be visible to a given pair of 
ground stations. For the orbit altitudes considered here, the satellite 
will be visible intermittently. Conceivably the relative positions of the 
satellites could be maintained so that at least one satellite is mutually 
visible from the two ground stations at all times. However, at least for 
some of the early systems proposed, it is of interest to consider the case 


1671 


1672 THE BELL SYSTEM TECHNICAL JOURNAL, SEPTEMBER 1962 


in which small differences in orbital period cause the relative positions 
of the satellites to vary with time. Attention is therefore directed to the 
statistical characteristics of satellite visibility. 

We are especially interested in those periods during which no satellite 
is available for communication between a given pair of ground stations. 
For convenience these events are called outages although, as pointed out 
by Rinehart and Robbins, these occurrences need not imply an inter- 
ruption of calls in progress. By analogy the intervals between outages 
are called innages. 

For any particular system the most effective method of obtaining 
outage information appears to be that of simulation. The course of the 
system for a year or more is computed, with the help of a high-speed 
digital computer, and the outages and innages recorded. [A method for 
doing this and some sample results are presented in the paper by Rine- 
hart and Robbins]. 

The present paper is concerned with determining the distribution of 
outage and innage lengths. The theory of outage length distribution has 
been studied by a number of people and some of their results, namely 
those which are needed here, are summarized in Section II. The traffic 
model is described in Section III. Sections IV and V contain results 
predicted by the model. In Section VI the predicted results are com- 
pared with those obtained by simulation. The work of Appendix B gives 
the basis for a model which is simpler but less accurate than the one 
described in Section III. The material in both appendices also appears 
to be of interest from the standpoint of traffic theory. 

The general conclusion is that, for the cases examined, both the out- 
age and innage distributions are approximately exponential. Their aver- 
ages are related by the rather simple equation (1). 


II. PRELIMINARY RESULTS 


First note that the average innage length ¢; and the average outage 
length ¢, (“a for innage and ‘“o” for outage) are connected by the re- 
lation 


= qd ox 
i; = ——— 1 
& (1) 
where gq is the quality of service; i.e., the fraction of time transmission 
is possible. This relation is always true and follows almost immediately 
from the definition of g. Typical values are q = 0.99 and ¢, = 0.25 hours; 


and it follows that the corresponding average innage length is t; = 
24.75 hours. 
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Next, let k be the number of satellites in the system. Consider a 
particular satellite. Every now and then it will pass over the region of 
mutual visibility. Let 


T = average length of time between its reappearances. For polar 
orbits, 7’ is the orbit time if the region of visibility includes 
one of the poles. 


b ' = average length of time the satellite is visible during one pass. 
The quantity b is a rate which occurs frequently in the analysis. 
p = fraction of time the satellite is visible. ‘rom these definitions 


it follows that 
b = pT. (2) 


The satellite systems considered here are restricted to those for which 
T and b ' are almost the same for all k satellites. 
With this notation we have 


q=1—(1—->p)’, (3) 

i. = (1 — p) T/k, (4) 

PG) < 1 = = O<t<s(l—-p)T (8) 
(hp) Fs) . 


where P(t) is the probability that the length of an outage will exceed £. 
Also the expected number of satellites visible at a given time is kp. 
These formulas are based upon the assumption that the phase angles of 
the satellites are independent random variables and are distributed uni- 
formly over the interval (0,27). Equation (3) is due to J. R. Pierce and 
R. Kompfner.” Equations (4) and (5) have been given by R. E. Mosher 
and R. I. Wilkinson, respectively, in unpublished memoranda. lor 
values of ¢ and k of interest, (5) may be approximated by 


P(t) = exp (— t/t). (6) 


It will be observed that the constants of the satellite system enter the 
right-hand sides of (3) to (5) only through the three parameters k,p,7’. 
Thus, as far as these formulas are concerned, the satellite system is 
specified by k,p,T’. 


III. TRAFFIC SYSTEM MODEL 


The satellite system will be represented by a traffic system model 
which consists of k independent servers, each having an average service 
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time b” and a service time distribution function B(t). Customers are 
supplied to the servers by k independent Poisson sources, each producing 
customers at an average rate a. When a source produces a customer, that 
particular source is removed from the system while the customer is 
being served. Thus, when 7 servers are busy serving n customers, the 
average arrival rate is (k — n) a. This type of input is a special case of a 
more general type (the limited source or “Iingset input’’?) which has 
been studied in traffic theory. 

The instant a satellite becomes visible from both the receiver and 
transmitter corresponds to the arrival of a customer. The length of 
time a satellite remains visible corresponds to the time required to serve 
a customer. After a customer has been served he leaves the system. This 
_ corresponds to the satellite leaving the region of mutual visibility. The 
state in which n satellites are visible simultaneously (state ) corre- 
sponds to the state in which n servers are busy serving n customers. An 
outage corresponds to an idle period (state 0), i.e., a period during which 
all servers are idle. An innage corresponds to a busy period, i.e., to a 
period when one or more servers are busy. 

Note that the constant orbit time of the satellite may introduce a 
regularity in the satellite arrivals. The traffic model has the shortcoming 
that there is no corresponding regularity in the customer arrivals. 

The analogy between the satellite system and the model is established 
by taking & and b to have the same values in both and setting 


a=1/(1—p)T. (7) 


To justify this choice for a, note that if a particular satellite is not visible 
at a time ¢ selected at random, the chance that it will become visible in 
t,t + dt is dt/(T’ — pT). Comparison with the corresponding proba- 
bility a dt for the traffic system gives (7). When the three satellite sys- 
tem parameters k,p,7’ are known, the three model parameters k, 0’, a 
follow at once from (7) and b* = pT. 


IV. OUTAGE AND QUALITY OF SERVICE PREDICTED BY MODEL 


The values of the quality of service g and average outage length ¢, 
predicted by the model agree exactly with (3) and (4) while the pre- 
dicted outage length distribution is the exponential approximation (6) 
to Wilkinson’s polynomial expression. A sketch of the proof of these 
statements is given in the following paragraphs. 

First assume the service distribution B(t) to be exponential, i.e., 
equal to 1 — e”’. Then since the sources are Poisson, the behavior of 
the system is governed by the k + 1 state equations (Ref. 3, p. 30) 
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Po = — kaPo + bP, 
Py = kaPo — |(k — 1l)a + bIPi + 20P2 
PJ = (k — 1)aP; — [(k — 2)a + 2b]P. + 3bP3 (3) 
Ps) = (k — 2)aP2 — [(k — 3)a + 3b]P3 + 4bP, 
Pil = aPya — kbP, 

where P,, = P,,(t) is the probability the system is in state n at time ¢ 


and primes denote time derivatives. The steady state probability p, 
that exactly n customers are present at a time picked at random is 


m= (Keim, m= baby (9) 


This follows upon setting the derivatives in (8) to zero, taking P,(t) = 
Dn, and solving the equations step by step. The expected number 7% of 
customers present is kab '/(1 + ab~’) and the average arrival rate is 


a(k — %) = ka/(1 + abd’). (10) 
The quality of service is now 
q=1-— Po (11) 


and expression (3) for g may be obtained by using a = 1/(1 — p)T, 
b-! = pT. Since, (2) an outage corresponds to state 0, (iz) state 0 can 
end only through an arrival, and (227) the arrivals in state 0 are Poisson 
with rate ka, it follows that exp (— kat) is the probability (predicted by 
the model) that the length of an outage will exceed ¢. This agrees with 
the exponential approximation (6), and the average value tf) = 1/(ke) 
agrees with (4). 

It is known that expression (9) for the steady state probability 
pn holds not only for exponential service but also for the general service 
distribution B(¢) (Ref. 3., p. 90). Hence the model predicts that expres- 
sions (3), (4) and (6) still hold when the length of time a particular 
satellite stays in view has an arbitrary distribution B(t). 


V. INNAGE DISTRIBUTION PREDICTED BY MODEL 


The average innage length (i.e., average busy period) predicted by 
the model when the service distribution B(t) is arbitrary follows from 
to = 1/ka and q = 1 — po: 
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ae = POs. 1G AP See 
0 





This much is easy. It is much more difficult to get the complete distribu- 
tion, as the following work shows. 

When the service lengths are exponentially distributed, the busy 
period distribution may be obtained by solving a “first passage” prob- 
lem. State 0 is made absorbing and the system is started with an arrival 
at time 0. Thus the system starts in state 1 at time 0 and jumps from 
state to state in accordance with the arrivals and departures of custom- 
ers. The system eventually lands in state 0 and stays there. This cor- 
responds to the end of the busy period or innage. 

When state 0 is made absorbing, the first two of the k + 1 state equa- 


tions (8) are replaced by 
P,! — bP, 
(12) 
PY = — [(k — 1l)a + bIP, + 2bP,. 


The modified equations (8) are to be solved subject to P,; = 1 and 
Po, P2, +++, Pi; = 0 at time 0. The probability that the length of an 
innage will exceed the length ¢ is 


G(t) =1— Pp. (13) 


Step-by-step computation of the derivatives of Po at t = O from the 
differential equations gives the power series 


bt. [(k -—1) at dle 
COS gt 21 
[(k —1)? a + 4(k — 1) ad + 0] be 
7 3! (14) 


+ [(k —1)' oa + (k — 1) (9k — 11) a? bd 








4 
+ 11(k — 1) al? +O] +. 


which is useful for small values of ¢. 

Since P, is determined by the last k differential equations of the 
modified set, and since the coefficients in these equations are constants, 
we may expect P; [and hence G(t)] to be expressible as the sum of k 
exponential terms. Indeed, when ¢,(s) is used to represent the Laplace 
transform of P,,(t), the k differential equations for P,(t), --- , P(t) go 
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into & linear equations for gi(s) , -- +, gx(s). By introducing the genera- 
ting function moe 

k 
(x, s) = roi(s) + --- + a°y,(s) 
in the usual way, the linear equations may be combined into 


d(x, s) 
£ 





byi(s) —~ 2 = (1 — x) (b+ az) — (steak — pet: >: 


To obtain ¢;(s), rewrite this equation as 


[by (s) = x _ a)? (b of: ai 


15 
= 2 (1 — 2)*(b + ax)**8(a, 8) a 
Ox 


where z = s/(a + b). Assume Re (z) > 0 and integrate (15) from x = 0 
to« = 1. The right-hand side vanishes because (0, s) is zero. Changing 
the variable of integration on the left-hand side from x to y = (1 — x)/ 
(b + az) gives | “oF 


1/b . yb e 
bls) [yA + ay)tdy = [= yy + ay) ay. 16) 


Expanding (1 + ay)* and integrating termwise shows that the coefficient 
of bgi(s) is b-7F;.(z) where 


P= p3} (*) a | (17) 


When the integrand on the right-hand side of (16) is replaced by its 
equivalent 





E + ee) 4 y* (1 + ay)* — LS y(1 + ay)* 
ka Meu ive 


(which is suggested by an integration by parts and adding and sub- 
tracting various terms) we obtain 


ba(s)Pile) = (1+ 2) male) — 2+ ay 
ka ka 
From P,’ = bP; it follows that sgo(s) = byg:(s) and hence 


= b(1 + abo) 
kaski(z)  ~ 


Ass > ©, F(z) tends to (1 + ab-!)"/z and go(s) is O(1/s). Writing 


1 L 
go(s) = aL es 
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¢o(s) as the sum of partial fractions, inverting to obtain Po(é), and using 
(13) gives the expression we seek: 


Q(t) = So OL + ab) exp [Ca + b)znil 


ma kaeml,! (2m) 


(18) 


where F’,’(z) = dF ,(z)/dz and 20, 2, °** , 2-1 are the zeros of F’,(z). 
These zeros lie between the poles at 0, —1, —2, --- , —k. When ¢ is 
large G(t) is given, effectively, by the term corresponding to m = 0 in 
(18). Usually zo is close to z = 0 and may be obtained by successive 
approximations from F(z) = 0, i.e., from 


1 + (k (ab *)” 
Zz =e (*) ztn- 


The foregoing method is the one originally used to obtain G(é) as the 
sum of exponential terms. Subsequently, a more elegant method of ob- 
taining (18) for G(t) was suggested by L. Takacs. His method removes 
the restriction that the service time distribution be exponential. 

Takacs’ result is the following: Let B(s), y(s) be the Laplace trans- 
forms of B’(t), —G’(t), the service time and busy period probability 
densities, respectively. The equation to determine y(s), given B(s), is 


1 “ —st k 

a ka = key (a) = I € [Paa(t)] dt (19) 
where Paa(é) has the Laplace transform 1/|s -+ a — a@(s)]. It will appear 
later that the subscript a refers to the idle state (state 0). The expres- 
sion (18) for G(t) may be obtained from (19) by starting with B(s) = 
b/(s + b). It turns out that P..(t) is given by [b + ae‘! '}/(b + a) 
so that the Laplace transform of [P.a(t)]* is not difficult to compute. 

One way to establish (19) is to regard the model as composed of k 
independent simple systems, each consisting of a source connected to a 
server. Consider a simple system. The lengths of the idle and busy 
periods have the respective probability densities a ¢ %, B’(t) with 
Laplace transforms a/(a + s), B(s). 

From (26) of Appendix A, the probability Paa(t) that an idle period 
will be in progress at time ¢, given that one is in progress at time 0, may 
be determined by inverting its Laplace transform 1/[s + a — aB(s)]. 
The probability that all & servers are idle at time t, given that they are 
idle at time 0, is [Paa(t)]’. Equation (19) now follows upon using (26) 
again. This time the type (a) intervals correspond to the periods (out- 
ages) during which all & servers are idle. The arrival rate is ak and pa(t) 
is ak ¢ “’. The type (b) intervals correspond to the innages with 
probability density —G’(t) and Laplace transform y(s). 
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VI. COMPARISON WITH SIMULATION 


It is interesting to compare the results predicted by the model with 
those obtained by simulation. As an example, we shall take results ob- 
tained by Rinehart and Robbins for a Maine-Western Europe link. This 
link was assumed to have 18 satellites in random polar orbits at a 
neight of 6,000 nautical miles. 

The positions of the orbits, the locations of the receiver and trans- 
mitter, and some computations involving a number of representative 
passages over the region of mutual visibility lead to 


1. the value 7’ = 6.35 hours for the average interval between reap- 
pearances of a particular satellite, 


2. the distribution function B(t) for 1, the length of time it is visible, 
3. an average value of | equal to b = 1.46 hours, 
4, the value p = b”'/T = 0.230 for the fraction of time the satellite 


is visible. 


It turns out that the probability density B’(¢) can be approximated by 
the rectangle 


| 0 , t < 0.86 hours 
B’(t) = <0.833, 0.86 < t < 2.06 (20) 
| 0 , 2.06 < t. 


The system was required to furnish a quality of service close to 0.99. 
This requirement together with the expression g = 1 — (1 — p)* and 
the value p = 0.230 gives k = 18 and gq = 0.99094. The model param- 
eters are therefore taken to be 


= 1 
( — : = Se 
k = 18, b 1.46 hours, a (i — p) 
1 (21) 
= aos = CO 
(0.770) (6.35) vee) 


with the understanding that p = 0.230. 

The values of q, t, ,t; obtained by simulation are compared with those 
predicted by the formulas of Section II (and also by the model) in 
Table I. 

The second column gives values obtained by Rinehart and Robbins 
by a simulation which followed the system for 18 months. The third 
column gives values computed from the model parameters (21). It is 
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TABLE [| 
Quantity Simulation Model, » = 0.230 Model, » = 0.234 
q 0.9918 0.99094 0.9918 
to 0.291 hours 1/ka = 0.271 0.270 
ti 35.1 hours ql—q)t. = 29.6 32.6 
bo} — 1.46 hours 1.48 








seen that the values of the average outage length ¢, agree fairly well. 
The discrepancy in ¢, reflects the shortcomings of the model. The last 
column shows what happens when we hold k and T at their former values 
of 18 and 6.35, and fudge the value of p so as to make q have the simula- 
tion value 0.9918. This changes p from 0.230 to 0.234 and ¢; from 29.6 
to 32.6. It. is seen that the value of t; is very sensitive to such changes. 

Tig. 1 shows three curves for the innage length distribution. The ordi- 
nate is G(t), the probability that an innage length will exceed ¢. Curve A 
is the curve predicted by the model assuming exponential service, and 
is obtained by substituting the parameter values (21) in expressions (14) 
and (18) for G(t). Curve B is the exponential approximation 


Coens = (22) 


with #; equal to the simulation value 35.1 hours. Curve C is the result 
obtained by simulation. During the 18 months simulated there were 122 
innages, the longest lying between 250 and 260 hours, the next longest 
between 190 and 200 hours, and so on. The average innage lengths cor- 
responding to curves A and C are #; = 29.6 and #; = 35.1, respectively, 
in agreement with the table above. 

The curves shown in Tig. 1 agree mdeestaly well. The agreement 
would be improved if curve A could: be shifted so as to give an average 
value of 35.1 instead of 29.6. Some of the discrepancy between curves A 
and C around t = 0 can be ascribed to the assumption of exponential 
service. Better agreement in this region could be obtained by taking the 
model to have the (almost) true service distribution (20). The proce- 
dure for doing this is indicated by (19) but the task of carrying through the 
work seems to be difficult. Again, it should be possible to use the “Erlang 
service” approximation 4b’t exp (—2bt) for B’(t) in (19) and also to 
solve the corresponding first passage problem [i.e., solve the equations 
corresponding to (8) and ( 12)|. However, this was not attempted. 

Some idea of the change produced in G(¢) when exponential service is 
replaced by other kinds of service may be obtained from Fig. 2. The 
curves of Fig. 2 show G(t) for the simplified model based upon the re- 
sults of Appendix B. Exponential service (curve D) and constant service 
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time (curve FE) are assumed, and Poisson arrivals are taken for both 
cases. The average arrival rate isa = k/T = 18/6.35 = 2.84 per hour 
and the average service time is Db’ = 1.46 hours. Substitution of these 
values in the expressions for G(t) given in examples (a) and (b) of 
Appendix B give curves D and E, respectively. 

The expanded scale at the top of Fig. 2 shows the behavior of G(¢) 
around ¢ = Q. Both distributions predict the same average innage 
length, namely 


i; = [e” * — 1Ja' = 22.0 hours (23) 


which is (43) in Appendix B. The discrepancy between 22.0 and the 
value t; = 29.6 given by the model of Section III shows the shortcomings 





A~PREDICTED BY MODEL OF SECTION 3 
tL = 29.6 HRS 





B-APPROXIMATION: G(t)= e-t/Ti 
WITH tj = 35.1 HRS 





C~SIMULATION - 18 MONTHS 
tL = 35.1HRS 








PROBABILITY THAT INNAGE LENGTH>t 





G(t)= 





TIME, t, IN HOURS 


Fig. 1 — Innage length distribution for 18-satellite system with random polar 
orbits (Maine-Western Europe). 
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TIME, t, IN HOURS 








E-CONSTANT SERVICE (ALL 
TIMES IN VIEW = 1.46HRS) 





AVERAGE ARRIVAL RATE 
= 2.84 PER HR. 


AVERAGE SERVICE TIME 
=1.46HRS 








G(t) = PROBABILITY THAT INNAGE LENGTH >t 





o 50 100 150 200 250 300 
TIME, t, IN HOURS 


Fig. 2— Innage length distributions. 


of the simplified model. Nevertheless, it appears that, the difference in 
shape between curves D and EF illustrates the change in G(t) produced 
by the different kinds of service. Support for this belief comes from the 
fact that curves A and D, both of which correspond to exponential 
service, have the same shape. 

In view of the inaccuracies of the models and of the relatively good 
agreement shown by the curves B and C of Fig. 1, it seems that the 
simple exponential approximation for G(¢) is quite good. 
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APPENDIX A 


Probabilities Associated with Alternating Sequences 


Iixpressions are recorded here for the Laplace transforms of two con- 
ditional probabilities. These transforms are of use in establishing Takacs’ 
result (19). 

Consider a sequence comprised of two kinds of intervals which alter- 
nate with each other (for example, innages and outages). Let pa(t), 
pv(t) be the probability densities for the lengths of the two types of 
intervals, and let their respective Laplace transforms be a(s), B(s). 
Suppose that the interval lengths are independent, let an interval of 
type (a) start at time 0, and let P..(t) be the probability that an interval 
of type (a) is in progress at time ¢. Then the Laplace transform of 
Paa(t) is 


2 —st 1 a a(s) 
é” Pat) di = ——_——.. 
J = ase) 
Similarly, if an interval of type (b) starts at time 0, the Laplace 
transform of the probability Py.(t) that an interval of type (a) is in 
progress at time ¢ is 


(24) 


( — a(s)|(s) 
fi — a(s)a(s)] ee 


These results are reminiscent of the relations between generating func- 
tions in the theory of recurrent events (Ref. 4, ch. 12). 


The expression (24) may be obtained by noting that 
Paa(t) = Pr(ta > t) + Pr(tar + tor + tas > t 
| and ta + ti <#) +--- 
= Pr(ta > t) + [Pr(ta + ta + tax > 6) 
— Pr(ta +t > t)J) + --: 


where ta, f1, faz, -°*: are the lengths of the successive (a) and (b) 
type intervals. The Laplace transforms of Pr(ta > t), Pr(ta + t1 > 
t), +--+ are [1 — a(s)]/s, [1 — a(s) B(s)]/s, +--+ , and summation gives 
(24) when | a(s) B(s) | < 1 (as it certainly is when Re(s) > 0). Expres- 
sion (25) may be obtained in a similar fashion or by convoluting p(t) 
and Paa(t). 

When p,(¢) = a exp (— at), the condition that a type (a) interval 
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start at time 0 may be replaced by the condition that a type (a) interval 
be in progress at time 0. In this case Paa(t) is also the probability that 
a type (a) interval is in progress at time ¢, given that one was in progress 
at time 0. From (24) and a(s) = a/(a +s), the Laplace transform of 
this probability is 


1 


s+ a— a8(s)’ 26) 


APPENDIX B 


Systems with an Infinite Number of Servers and Recurrent Inputs 


This appendix will be concerned with systems containing an infinite 
number of servers and a “recurrent”’ input, 1.e., an input in which the 
lengths of the intervals between successive arrivals are independent of 
each other and of the state of the servers. In many respects these sys- 
tems are simpler than the limited source input introduced in Section 
III. Although they do not represent the satellite system as well, their 
greater simplicity enables us to estimate the shape of the innage dis- 
tribution for cases which are difficult to handle by Takacs’ result (19). 

In the following list of results, A(¢) is the distribution function for 
the distances between the arrivals of the recurrent input and B(¢) is 
the service time distribution for each one of the infinite number of 
servers. The expected distance between arrivals is a’ and the expected 
service time is b’. 


B.1 The Conditional Probability Pio (t) 


Let all of the servers be idle at time —O and let the first customer 
arrive at time 0 making one server busy at +0. Denote by P:,(t) the 
conditional probability that n servers are busy at time ¢. Consideration 
of the first arrival following time +0 leads to an integral equation 
which (in theory) may, be solved for Pio(t), namely 


Py(t) = [1 — A(#)]B(t) + B(t) [ A'(t — v)Py(v) dv, (27) 


where A’(u) = dA(u)/du. A corresponding equation for the generating 
function for P;,(t) is given by (44). 

Example (a). For Poisson input 1 — A(é) is exp (— at). This cor- 
responds to an unlimited source input. Substituting in (27), multiplying 
through by [exp (at)]/B(t), and differentiating with respect to ¢ gives a 
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differential equation for Pyo(t). Using the fact that Pyo(é) > B(d) as 
t — 0 to fix the constant of integration leads to 


Py(t) = B(t) exp | -2 i fl — BOa)lear |, (28) 


a result given by Refs. 5 and 6. The work may be simplified by starting 
with the assumption that Pio(t) is of the form B(t)P(t). 

Example (b). For regularly spaced arrivals, A’(w) = 6(u — a’) where 
6(t) is the unit impulse. Equation (27) then gives 


Pyo(t) = B(t), 0<ti< a 
Py(t) = B(t)Pi(t — a“) oe 

- re _, (29) 
Piy(t) = B(t)B(t— a) a <t<2a 


Py(t) = B(OB(t — a")B(t — 2a") = 2a < t < Ba? 


and so on. 

Example (c). The ease when A(t) is arbitrary and 1 — B(t) = 
exp (— Jt) has been studied by Takacs’ (see also Ref. 3, p. 33 et seq.). 
Multiplying (27) by exp (— st), integrating ¢ from 0 to ~, and intro- 
ducing the Laplace transforms a(s), 0(s) of A’(t), Pio(t) leads to a re- 
currence relation between 6(s) and 6(s + 1) which in turn gives 


eerste ees ot deat ites SO, ees Ee | 
= aa aan ta) eo) NT RN tes 
As41As42 


: (oat ail—eaae 


where asin is written for a(s + n). When 6(s) is known P,(t) may be 
obtained by inversion. 





B.2 The Busy Period Distribution G(t) for Poisson Arrivals 


Let —G’(t) be the probability density for the lengths of the busy 
periods (corresponding to innages) and consider the case of Poisson 
input and arbitrary service. The Laplace transform y(s) of —G’(t) is 
given by 


(s + a)6(s) 
1 + ad(s) 


where 6(s) is the Laplace transform of Pio(t). 


y(s) = (31) 
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My original derivation of (81) was based on taking the Laplace trans- 
form of 


-4'() =f) af dsPolt — f(s) 
t 8 (32) 
+ a | ds Py(t — s) i dr Py(s — r)f(r) — +++, 


S(t) = Pw’ (t) + aPyo(t), (33) 


where Py(t) is given by (28) and f(t) dé is the probability that the 
system will jump from state 1 to state 0 in (t, ¢ + dt), given an arrival 
at time 0 which ends an idle period. The series (32) was obtained by 
an application of the method of inclusion and exclusion. Subsequently, 
Takacs obtained a formula equivalent to (31) by an elegant method based 
upon results of the type stated in Appendix A. When this method is 
applied to obtain (31) the type (a) intervals are taken to be idle periods 
(outages) with p.(t) = aexp (— at), a(s) = a/(a + s). The type (b) 
intervals become the busy periods so that ps(t), 8(s) become —G'’(t), 
vy(s), respectively; and Pya(t) goes into Pyo(t). Expression (25) of 
Appendix A then says that the Laplace transform of Pio(¢) is 


= v(s) 
6(s) = a ne (34) 


from which (31) follows. 
Example (a). For Poisson arrivals and exponential service with 
B=1-—e”, (28) becomes (Ref. 3, p. 26) 


Py(t) = (1 — &) exp[— p+ pe, = ab". 


The change of variable y = e” carries the integral for the Laplace 
transform of P(t) into 


bo(s) = F(z) — F(2 +1), z= sb” 

where 

1 

F(z) =e? i y” le” dy 

0 
ne ae sree p ae. 5 4 
z 2e+il)' 2ze+1)(z4+ 2) 
=4 = pe? 
zl — pPF(2 +1) =D 


nao N'i(z + n)- 


ll 
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The Laplace transform of G(¢) may be shown to be 


and inversion gives 


Gt) = ria Gy = ab (37) 
where F(z) = dF(z)/dz. The zeros of F(z) are real and (a) occur at 
20,21, 22,°°*, (b) aresuch that -1<24<0,-2<a< —-l,::-, 
etc. and hence lie between the poles of F(z) at 0, —1, —2, --- , (c) may 
be computed by successive approximations with the help of the last 
series for F(z) in (85). 

A power series for G(t) may be obtained by expanding [1 — y(s)]/s 
in powers of 1/s and then replacing s-"* by t"/n!. The same series may 
be obtained from the corresponding series (5.3) for the limited source 
case by letting k — », a — 0 in such a way as to keep ka equal to a. 
Replacing a by pb then gives 


2 2 3 
ety 24. 4 “ i. (p oa _@ t+ = 1) (bt) 


4 (9 +9 +11p +1) (bt!) 
4! 


(38) 


Example (b). For Poisson arrivals and constant service time, B(t) 
jumps from 0 to 1 at time t = b”’. Equation (28) shows that Pio(t) 
jumps from 0 to exp (— ab”) at ¢ = b*. The Laplace transforms are 
readily computed and the one for G(¢) gives 
1 cio [1 _ g ere nar 


G(t) = = 


Dri Join Sp genre 4, C0, (39) 


Taking c large enough, say c = b, to make |x| < 1 where z is as” 
exp [— (a + s)b™’] and expanding the denominator in powers of x leads 
to 


Git) =1+ d (—peryn| = at + ar". (40) 


(n — 1) !p n! 


Here, as in example (a), p = ab‘ and the upper limit of summation is 
determined from N < bt < N + 1. In particular, 
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G0) =1, Gib'—0)=1, Gb'+0)=1-e"° 
G(2b") = 1— (1+ p)e&” (41) 
G03) = 1 (1+ per Ho (L + Be 

For large values of f 
Oe (oo + p) ot (42) 


p(1 + oo) 


where go is the rightmost root of 


otpe’”’=0. 


When p is large oo is approximately — pe °. 


The innages corresponding to examples (a) and (b) have the same 
average length, namely 


ts = [e* — 1J/a. (43) 


To see this, note that from traffic theory, or by letting t > © in (28), 
the fraction of idle time is p = 1 — q = exp (— ab’). The average 
outage time is f& = 1/a, and (43) follows upon using the relation (1) 
between #;, é , and g. 


B.3 Miscellaneous Results for Recurrent Input 


Except for the case in which arrivals occur at multiples of some fixed 
spacing, Pio(t) approaches the steady state probability po as t > ~, 
and the Laplace transform 6(s) of Pyo(t) has a simple pole at s = 
with residue po . 

If Pyo(t) tends to a periodic function the residue gives its average 
value. Application of this result to the case of regularly spaced arrivals 
and exponential service with B(t) = 1 — e~’ leads to the rather curious 
expansion 
[I 


0 n= 


(La) dr 
0 


1 


1 1 
rs ok ei Ss l(2—1) |. 


Both sides represent the average value of the periodic function to which 
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P(t) tends as  —» ©. The integral on the left (with zx = exp a,x > 1) 
is the average value as computed from (29), while the series on the right 
is the residue of 0(s) at s = 0 obtained by setting a(s) = exp (— sa’) 
in (30) and letting s > 0. 

The generating function 


Pye; t) => > xv” Pin(t) 


for the conditional probability P1,(¢) that state n exists at time #, given 
that an arrival at time 0 ends an idle period, [see (27)] satisfies the 
integral equation 


Py(a,t) = {fe + (1 — x) B(t)] 
t 4 
ii — A(t) + | A’(t — v)P,(2, v) av oo 


A formal step-by-step solution may be obtained by introducing the 
binomial moments 1/,(t) defined by 


P,(a, t) 


Y (e = 1d), 


Me) => (“) Pa(t). 


k=n 


The value of 1/o(t) is one and the higher-order moments satisfy integral 
equations obtainable from (44). When the Laplace transforms of 
A’(t), B’(t), M(t) are denoted by a(s), B(s), un(s) it is found that 
the integral equations lead to 





Qe, z@= ae 
_ 1 fe? [1 p(s — 2) a(z) taa(z) 
unl) = Qt Jo—je s—Z 1 — a(s) ae teenly, M9) 


The singularities of a(z)u,»-1(z) are supposed to lie to the left and those 
of [1 — B(s — z)]/(s — 2) to the right of the path of integration, s 
being chosen so as to make this possible. In theory, the successive values 
of un(s) may be obtained step by step and thus ultimately lead to an 
expression for P,(z, ¢). For exponential service the integrals in (45) 
may be evaluated and lead to results given by Takacs’ (see also Ref. 3, 
p. 33). 
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